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PREFACE 


Reports  in  this  volume  are  numbered  consecutively  beginning  with  number  1 .  Each  report  is 
paginated  with  the  report  number  followed  by  consecutive  page  numbers,  e.g.,  1-1,  1-2,  1-3;  2-1,  2-2, 


2-3. 


Due  to  its  length,  Volume  5  is  bound  in  three  parts,  5  A,  5B  and  5C.  Volume  5A  contains 
#1-24.  Volume  5B  contains  reports  #25-48  and  5C  contains  #49-70.  The  Table  of  Contents  for 
Volume  5  is  included  in  all  parts. 

This  document  is  one  of  a  set  of  16  volumes  describing  the  1996  AFOSR  Summer  Research 
Program.  The  following  volumes  comprise  the  set: 
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INTRODUCTION 


The  Summer  Research  Program  (SRP),  sponsored  by  the  Air  Force  Office  of  Scientific 
Research  (AFOSR),  offers  paid  opportunities  for  university  faculty,  graduate  students,  and  high 
school  students  to  conduct  research  in  U.S.  Air  Force  research  laboratories  nationwide  during 
the  summer. 

Introduced  by  AFOSR  in  1978,  this  innovative  program  is  based  on  the  concept  of  teaming 
araHpmir  researchers  with  Air  Force  scientists  in  the  same  disciplines  using  laboratory  facilities 
and  equipment  not  often  available  at  associates'  institutions. 

The  Summer  Faculty  Research  Program  (SFRP)  is  open  annually  to  approximately  150  faculty 
members  with  at  least  two  years  of  teaching  and/or  research  experience  in  accredited  U.S. 
colleges,  universities,  or  technical  institutions.  SFRP  associates  must  be  either  U.S.  citizens  or 
permanent  residents. 

Die  Graduate  Student  Research  Program  (GSRP)  is  open  annually  to  approximately  100 
graduate  students  holding  a  bachelor's  or  a  master's  degree;  GSRP  associates  must  be  U.S. 
citizens  enrolled  full  time  at  an  accredited  institution. 

Die  High  School  Apprentice  Program  (HSAP)  annually  selects  about  125  high  school  students 
located  within  a  twenty  mile  commuting  distance  of  participating  Air  Force  laboratories. 

AFOSR  also  offers  its  research  associates  an  opportunity,  under  the  Summer  Research 
Extension  Program  (SREP),  to  continue  their  AFOSR-sponsored  research  at  their  home 
institutions  through  the  award  of  research  grants.  In  1994  the  maximum  amount  of  each  grant 
was  increased  from  $20,000  to  $25,000,  and  the  number  of  AFOSR-sponsored  grants 
decreased  from  75  to  60.  A  separate  annual  report  is  compiled  on  the  SREP. 

The  numbers  of  projected  summer  research  participants  in  each  of  the  three  categories  and 
SREP  “grants”  are  usually  increased  through  direct  sponsorship  by  participating  laboratories. 

AFOSR' s  SRP  has  well  served  its  objectives  of  building  critical  links  between  Air  Force 
research  laboratories  and  the  academic  community,  opening  avenues  of  communications  and 
forging  new  research  relationships  between  Air  Force  and  academic  technical  experts  in  areas  of 
national  interest,  and  strengthening  the  nation's  efforts  to  sustain  careers  in  science  and 
engineering.  Die  success  of  the  SRP  can  be  gauged  from  its  growth  from  inception  (see  Table 
1)  and  from  the  favorable  responses  the  1996  participants  expressed  in  end-of-tour  SRP 
evaluations  (Appendix  B). 

AFOSR  contracts  for  administration  of  the  SRP  by  civilian  contractors.  The  contract  was  first 
awarded  to  Research  &  Development  Laboratories  (RDL)  in  September  1990.  After 
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completion  of  the  1990  contract,  RDL  (in  1993)  won  the  recompetition  for  the  basic  year  and 
four  1-year  options. 


2.  PARTICIPATION  IN  THE  SUMMER  RESEARCH  PROGRAM 


The  SRP  began  with  faculty  associates  in  1979;  graduate  students  were  added  in  1982  and  high 
school  students  in  1986.  The  following  table  shows  the  number  of  associates  in  the  program 
each  year. 


YEAR 

SRP  Participation,  by  Year 

TOTAL  j 

SFRP 

GSRP 

HSAP 

1979 

70 

70 

1980 

87 

87 

1981 

87 

87 

1982 

91 

17 

108 

1983 

101 

53 

154  1 

1984 

152 

84 

236 

1985 

154 

92 

246 

1986 

158 

100 

42 

300 

1987 

159 

101 

73 

333 

1988 

153 

107 

101 

361 

1989 

168 

102 

103 

373 

1990 

165 

121 

132 

418 

1991 

170 

142 

132 

444 

1992 

185 

121 

159 

464 

1993 

187 

117 

136 

440 

1994 

192 

117 

133 

442 

1995 

190 

115 

137 

442 

|  1996 

188 

109 

138 

435 
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Beginning  in  1993,  due  to  budget  cuts,  some  of  the  laboratories  weren’t  able  to  afford  to  fund 
as  many  associates  as  in  previous  years.  Since  then,  the  number  of  funded  positions  has 
remained  fairly  constant  at  a  slightly  lower  level. 


3.  RECRUITING  AND  SELECTION 

The  SRP  is  conducted  on  a  nationally  advertised  and  competitive-selection  basis.  The 
advertising  for  faculty  and  graduate  students  consisted  primarily  of  the  mailing  of  8,000  52- 
page  SRP  brochures  to  chairpersons  of  departments  relevant  to  AFOSR  research  and  to 
administrators  of  grants  in  accredited  universities,  colleges,  and  technical  institutions. 
Historically  Black  Colleges  and  Universities  (HBCUs)  and  Minority  Institutions  (Mis)  were 
included.  Brochures  also  went  to  all  participating  USAF  laboratories,  the  previous  year's 
participants,  and  numerous  individual  requesters  (over  1000  annually). 

RDL  placed  advertisements  in  the  following  publications:  Black  Issues  in  Higher  Education, 
Winds  of  Change,  and  TREE  Spectrum.  Because  no  participants  list  either  Physics  Today  or 
Chemical  &  Engineering  News  as  being  their  source  of  learning  about  the  program  for  the  past 
several  years,  advertisements  in  these  magazines  were  dropped,  and  the  funds  were  used  to 
cover  increases  in  brochure  printing  costs. 

High  school  applicants  can  participate  only  in  laboratories  located  no  more  than  20  miles  from 
their  residence.  Tailored  brochures  on  the  HSAP  were  sent  to  the  head  counselors  of  180  high 
schools  in  the  vicinity  of  participating  laboratories,  with  instructions  for  publicizing  the  program 
in  their  schools.  High  school  students  selected  to  serve  at  Wright  Laboratory's  Armament 
Directorate  (Eglin  Air  Force  Base,  Florida)  serve  eleven  weeks  as  opposed  to  the  eight  weeks 
normally  worked  by  high  school  students  at  all  other  participating  laboratories. 

Each  SFRP  or  GSRP  applicant  is  given  a  first,  second,  and  third  choice  of  laboratory.  High 
school  students  who  have  more  than  one  laboratory  or  directorate  near  their  homes  are  also 
given  first,  second,  and  third  choices. 

Laboratories  make  their  selections  and  prioritize  their  nominees.  AFOSR  then  determines  the 
number  to  be  funded  at  each  laboratory  and  approves  laboratories'  selections. 

Subsequently,  laboratories  use  their  own  funds  to  sponsor  additional  candidates.  Some  selectees 
do  not  accept  the  appointment,  so  alternate  candidates  are  chosen.  This  multi-step  selection 
procedure  results  in  some  candidates  being  notified  of  their  acceptance  after  scheduled 
deadlines.  The  total  applicants  and  participants  for  1996  are  shown  in  this  table. 
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1996  Applicants  and  Participants  1 

PARTICIPANT 

TOTAL 

SELECTEES 

DECLINING 

CATEGORY 

APPLICANTS 

SELECTEES 

SFRP 

572 

188 

39 

(HBCU/MI) 

(119) 

(27) 

(5) 

GSRP 

235 

109 

7 

(HBCU/MI) 

(18) 

(7) 

(i)  ! 

HSAP 

474 

138 

8 

TOTAL 

1281 

435 

54 

4.  SITE  VLSITS 

During  June  and  July  of  1996,  representatives  of  both  AFOSR/NI  and  RDL  visited  each 
participating  laboratory  to  provide  briefings,  answer  questions,  and  resolve  problems  for  both 
laboratory  personnel  and  participants.  The  objective  was  to  ensure  that  the  SRP  would  be  as 
constructive  as  possible  for  all  participants.  Both  SRP  participants  and  RDL  representatives 
found  these  visits  beneficial.  At  many  of  the  laboratories,  this  was  the  only  opportunity  for  all 
participants  to  meet  at  one  time  to  share  their  experiences  and  exchange  ideas. 


5.  HISTORICALLY  BLACK  COLLEGES  AND  UNIVERSITIES  AND  MINORITY 
INSTITUTIONS  (HBCU/MLs) 

Before  1993,  an  RDL  program  representative  visited  from  seven  to  ten  different  HBCU/Mis 
annually  to  promote  interest  in  the  SRP  among  the  faculty  and  graduate  students.  These  efforts 
were  marginally  effective,  yielding  a  doubling  of  HBCI/MI  applicants.  In  an  effort  to  achieve 
AFOSR’s  goal  of  10%  of  all  applicants  and  selectees  being  HBCU/MI  qualified,  the  RDL  team 
decided  to  try  other  avenues  of  approach  to  increase  the  number  of  qualified  applicants. 
Through  the  combined  efforts  of  the  AFOSR  Program  Office  at  Bolling  AFB  and  RDL,  two 
very  active  minority  groups  were  found,  HACU  (Hispanic  American  Colleges  and  Universities) 
and  AISES  (American  Indian  Science  and  Engineering  Society).  RDL  is  in  communication 
with  representatives  of  each  of  these  organizations  on  a  monthly  basis  to  keep  up  with  the  their 
activities  and  special  events.  Both  organizations  have  widely-distributed  magazines/quarterlies 
in  which  RDL  placed  ads. 

Since  1994  the  number  of  both  SFRP  and  GSRP  HBCU/MI  applicants  and  participants  has 
increased  ten-fold,  from  about  two  dozen  SFRP  applicants  and  a  half  dozen  selectees  to  over 
100  applicants  and  two  dozen  selectees,  and  a  half-dozen  GSRP  applicants  and  two  or  three 
selectees  to  18  applicants  and  7  or  8  selectees.  Since  1993,  the  SFRP  had  a  two-fold  applicant 
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increase  and  a  two-fold  selectee  increase.  Since  1993,  the  GSRP  had  a  three-fold  applicant 
increase  and  a  three  to  four-fold  increase  in  selectees. 

In  addition  to  RDL's  special  recruiting  efforts,  AFOSR  attempts  each  year  to  obtain  additional 
funding  or  use  leftover  funding  from  cancellations  the  past  year  to  fund  HBCU/MI  associates. 
This  year,  5  HBCU/MI  SFRPs  declined  after  they  were  selected  (and  there  was  no  one 
qualified  to  replace  them  with).  The  following  table  records  HBCU/MI  participation  in  this 

program. 


SRP  HBCU/MI  Participation,  By  Year 

YEAR 

SFRP 

O 

IP 

Applicants 

Participants 

Applicants 

Participants 

1985 

76 

23 

15 

11 

1986 

70 

18 

20 

10 

1987 

82 

32 

32 

10 

1988 

53 

17 

23 

14 

1989 

39 

15 

13 

4 

1990 

43 

14 

17 

3 

- — - 

1991 

42 

13 

8 

5 

1992 

70 

13 

9 

5 

1993 

60 

13 

6 

2 

1994 

90 

16 

11 

6 

1995 

90 

21 

20 

8 

1996 

119 

27 

t 

00 

7 

6.  SRP  FUNDING  SOURCES 

Funding  sources  for  the  1996  SRP  were  the  AFOSR-provided  slots  for  the  basic  contract  and 
laboratory  funds.  Funding  sources  by  category  for  the  1996  SRP  selected  participants  are 

shown  here. 
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1996  SRP  FUNDING  CATEGORY 

SFRP 

GSRP 

HSAP 

AFOSR  Basic  Allocation  Funds 

141 

85 

123 

USAF  Laboratory  Funds 

37 

19 

15 

HBCU/MI  By  AFOSR 
(Using  Procured  Addn’l  Funds) 

10 

5 

0 

TOTAL 

188 

109 

138 

SFRP  - 150  were  selected,  but  nine  canceled  too  late  to  be  replaced. 

GSRP  -  90  were  selected,  but  five  canceled  too  late  to  be  replaced  (10  allocations  for 
the  ALCs  were  withheld  by  AFOSR) 

HSAP  - 125  were  selected,  but  two  canceled  too  late  to  be  replaced. 


7.  COMPENSATION  FOR  PARTICIPANTS 


Compensation  for  SRP  participants,  per  five-day  work  week,  is  shown  in  this  table. 


1996  SR] 

?  Associate  Compensation 

PARTICIPANT  CATEGORY 

1991 

1992 

1993 

1994 

1995 

- 

1996 

Faculty  Members 

$690 

$718 

$740 

$740 

$740 

$770 

Graduate  Student 
(Master's  Degree) 

$425 

$442 

$455 

$455 

$455 

$470 

Graduate  Student 
(Bachelor's  Degree) 

$365 

$380 

$391 

$391 

$391 

$400 

High  School  Student 
(First  Year) 

$200 

$200 

$200 

$200 

$200 

$200  j 

High  School  Student 
(Subsequent  Years) 

$240 

$240 

$240 

$240 

$240 

$240  1 

The  program  also  offered  associates  whose  homes  were  more  than  50  miles  from  the  laboratory 
an  expense  allowance  (seven  days  per  week)  of  $50/day  for  faculty  and  $40/day  for  graduate 
students.  Transportation  to  the  laboratory  at  the  beginning  of  their  tour  and  back  to  their  home 
destinations  at  the  end  was  also  reimbursed  for  these  participants.  Of  the  combined  SFRP  and 
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GSRP  associates,  65  %  (194  out  of  297)  claimed  travel  reimbursements  at  an  average  round- 
trip  cost  of  $780. 

Faculty  members  were  encouraged  to  visit  their  laboratories  before  their  summer  tour  began. 
All  costs  of  these  orientation  visits  were  reimbursed.  Forty-five  percent  (85  out  of  188)  of 
faculty  associates  took  orientation  trips  at  an  average  cost  of  $444.  By  contrast,  in  1993,  58  % 
of  SFRP  associates  took  orientation  visits  at  an  average  cost  of  $685;  that  was  the  highest 
percentage  of  associates  opting  to  take  an  orientation  trip  since  RDL  has  administered  the  SRP, 
and  the  highest  average  cost  of  an  orientation  trip.  These  1993  numbers  are  included  to  show 
the  fluctuation  which  can  occur  in  these  numbers  for  planning  purposes. 

Program  participants  submitted  biweekly  vouchers  countersigned  by  their  laboratory  research 
focal  point,  and  RDL  issued  paychecks  so  as  to  arrive  in  associates'  hands  two  weeks  later. 

In  1996,  RDL  implemented  direct  deposit  as  a  payment  option  for  SFRP  and  GSRP  associates. 
There  were  some  growing  pains.  Of  the  128  associates  who  opted  for  direct  deposit,  17  did  not 
check  to  ensure  that  their  financial  institutions  could  support  direct  deposit  (and  they  couldn’t), 
and  eight  associates  never  did  provide  RDL  with  their  banks’  ABA  number  (direct  deposit  bank 
routing  number),  so  only  103  associates  actually  participated  in  the  direct  deposit  program.  The 
remaining  associates  received  their  stipend  and  expense  payments  via  checks  sent  in  the  US 
mail. 

HSAP  program  participants  were  considered  actual  RDL  employees,  and  their  respective  state 
and  federal  income  tax  and  Social  Security  were  withheld  from  their  paychecks.  By  the  nature 
of  their  independent  research,  SFRP  and  GSRP  program  participants  were  considered  to  be 
consultants  or  independent  contractors.  As  such,  SFRP  and  GSRP  associates  were  responsible 
for  their  own  income  taxes,  Social  Security,  and  insurance. 

8.  CONTENTS  OF  THE  1996  REPORT 

The  complete  set  of  reports  for  the  1996  SRP  includes  this  program  management  report 
(Volume  1)  augmented  by  fifteen  volumes  of  final  research  reports  by  the  1996  associates,  as 
indicated  below: 


1996  SRP  Final  Report  Volume  Assignments 


I  LABORATORY 

SFRP 

GSRP 

HSAP 

|  Armstrong 

2 

7 

12 

Phillips 

3 

8 

13 

Rome 

4 

9 

14 

Wright 

5A,  5B 

10 

15 

AEDC,  ALCs,  WHMC 

6 

11 

16 

7 


APPENDIX  A  -  PROGRAM  STATISTICAL  SUMMARY 

A.  Colleges/Universities  Represented 

Selected  SFRP  associates  represented  169  different  colleges,  universities,  and 
institutions,  GSRP  associates  represented  95  different  colleges,  universities,  and  institutions. 

B.  States  Represented 

SFRP  -Applicants  came  from  47  states  plus  Washington  D.C.  and  Puerto  Rico. 
Selectees  represent  44  states  plus  Puerto  Rico. 

GSRP  -  Applicants  came  from  44  states  and  Puerto  Rico.  Selectees  represent  32  states. 
HSAP  -  Applicants  came  from  thirteen  states.  Selectees  represent  nine  states. 


Total  Number  of  Participants 

SFRP 

188 

GSRP 

109 

HSAP 

138 

TOTAL 

435 

|  Degrees  Represented 

SFRP 

GSRP 

TOTAL  I 

|  Doctoral 

184 

1 

185 

Master’s 

4 

48 

52 

Bachelor's 

0 

60 

60 

TOTAL 

188 

109 

297 

A-l 


_  SFRP  Academic  Titles 

Assistant  Professor 
Associate  Professor 
Professor 
Instructor 


Chairman 


Visiting  Professor 


Visiting  Assoc.  Prof. 


Research  Associate 


TOTAL 


|  Source  of  Learning  About  the  SRP 

Categoiy 

Applicants 

Selectees 

Applied/participated  in  prior  years 

28% 

34% 

Colleague  familiar  with  SRP 

19% 

16% 

Brochure  mailed  to  institution 

23% 

17% 

Contact  with  Air  Force  laboratory 

17% 

23% 

IEEE  Spectrum 

2% 

1% 

biihe 

1% 

1% 

Other  source 

10% 

8% 

total 

100% 

100% 
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APPENDIX  B  -  SRP  EVALUATION  RESPONSES 

1.  OVERVIEW 

Evaluations  were  completed  and  returned  to  RDL  by  four  groups  at  the  completion  of  the  SRP. 
The  number  of  respondents  in  each  group  is  shown  below. 


Table  B-l.  Total  SRP  Evaluations  Received 


Evaluation  Group 

Responses 

SFRP  &  GSRPs 

275 

HSAPs 

113 

USAF  Laboratory  Focal  Points 

84 

USAF  Laboratory  HSAP  Mentors 

6 

All  groups  indicate  unanimous  enthusiasm  for  the  SRP  experience. 


The  summarized  recommendations  for  program  improvement  from  both  associates  and 
laboratory  personnel  are  listed  below: 


A.  Better  preparation  on  the  labs’  part  prior  to  associates'  arrival  (i.e. ,  office  space, 
compute-  assets,  clearly  defined  scope  of  work). 

B.  Faculty  Associates  suggest  higher  stipends  for  SFRP  associates. 

C.  Both  HSAP  Air  Force  laboratory  mentors  and  associates  would  like  the  summer 
tour  extended  from  the  current  8  weeks  to  either  10  or  11  weeks;  the  groups 
state  it  takes  4-6  weeks  just  to  get  high  school  students  up-to- speed  on  what’s 
going  on  at  laboratory.  (Note:  this  same  argument  was  used  to  raise  the  faculty 
and  graduate  student  participation  time  a  few  years  ago.) 
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2.  1996  USAF  LABORATORY  FOCAL  POINT  (LFP)  EVALUATION  RESPONSES 


The  summarized  results  listed  below  are  from  the  84  LFP  evaluations  received. 
1  •  LFP  evaluations  received  and  associate  preferences: 


LFP  Evaluation  Summary.  The  summarized  responses,  by  laboratory,  are  listed  on  the 

ollowing  page.  LFPs  were  asked  to  rate  the  following  questions  on  a  scale  from  1  (below 
average)  to  5  (above  average). 


2.  LFPs  involved  in  SRP  associate  application  evaluation  process: 

a.  Time  available  for  evaluation  of  applications: 

b.  Adequacy  of  applications  for  selection  process: 

3.  Value  of  orientation  trips: 

4.  Length  of  research  tour 

5  a.  Benefits  of  associate's  work  to  laboratory: 
b.  Benefits  of  associate's  work  to  Air  Force: 

6.  a.  Enhancement  of  research  qualifications  for  LFP  and  staff: 

b.  Enhancement  of  research  qualifications  for  SFRP  associate: 

c.  Enhancement  of  research  qualifications  for  GSRP  associate: 

7.  a.  Enhancement  of  knowledge  for  LFP  and  staff: 

b.  Enhancement  of  knowledge  for  SFRP  associate: 

c.  Enhancement  of  knowledge  for  GSRP  associate: 

8.  Value  of  Air  Force  and  university  links: 

9.  Potential  for  future  collaboration: 

10.  a.  Your  working  relationship  with  SFRP: 
b.  Your  working  relationship  with  GSRP: 

1 1 .  Expenditure  of  your  time  worthwhile: 

(Continued  on  next  page) 
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12.  Quality  of  program  literature  for  associate: 

13.  a.  Quality  of  RDL's  communications  with  you: 

b.  Quality  of  RDL's  communications  with  associates: 

14.  Overall  assessment  of  SRP: 


Table] 

B-3.  Laboratory  Focal  Point  Reponses  to  above  questions 

AEDC 

AL 

FJSRL 

PL 

RL 

WHMC 

WL 

#  Evals  Reev’d 

7 

1 

14 

5 

0 

46 

Question  # 

2 

- 

86  % 

0  % 

88  % 

80  % 

- 

85  % 

2a 

- 

4.3 

n/a 

3.8 

4.0 

- 

3.6 

2b 

- 

4.0 

n/a 

3.9 

4.5 

- 

4.1 

3 

- 

4.5 

n/a 

4.3 

4.3 

- 

3.7 

4 

- 

4.1 

4.0 

4.1 

4.2 

- 

3.9 

5a 

- 

4.3 

5.0 

4.3 

4.6 

- 

4.4 

5b 

- 

4.5 

n/a 

4.2 

4.6 

- 

4.3 

6a 

- 

4.5 

5.0 

4.0 

4.4 

- 

4.3 

6b 

- 

4.3 

n/a 

4.1 

5.0 

- 

4.4 

6c 

- 

3.7 

5.0 

3.5 

5.0 

- 

4.3 

7a 

- 

4.7 

5.0 

4.0 

4.4 

- 

4.3 

7b 

- 

4.3 

n/a 

4.2 

5.0 

- 

4.4 

7c 

- 

4.0 

5.0 

3.9 

5.0 

- 

4.3 

8 

- 

4.6 

4.0 

4.5 

4.6 

- 

4.3 

9 

- 

4.9 

5.0 

4.4 

4.8 

- 

4.2 

10a 

- 

5.0 

n/a 

4.6 

4.6 

- 

4.6 

10b 

- 

4.7 

5.0 

3.9 

5.0 

- 

4.4 

11 

- 

4.6 

5.0 

4.4 

4.8 

- 

4.4 

12 

- 

4.0 

4.0 

4.0 

4.2 

- 

3.8 

13a 

- 

3.2 

4.0 

3.5 

3.8 

- 

3.4 

13b 

- 

3.4 

4.0 

3.6 

4.5 

- 

3.6 

14 

- 

4.4 

5.0 

4.4 

4.8 

- 

4.4 
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3.  1996  SFRP  &  GSRP  EVALUATION  RESPONSES 


The  summarized  results  listed  below  are  from  the  257  SFRP/GSRP  evaluations  received. 

Associates  were  asked  to  rate  the  following  questions  on  a  scale  from  1  (below  average)  to  5 
(above  average)  -  by  Air  Force  base  results  and  over-all  results  of  the  1996  evaluations  are 
listed  after  the  questions. 

1.  The  match  between  the  laboratories  research  and  your  field: 

2.  Your  working  relationship  with  your  LFP: 

3.  Enhancement  of  your  academic  qualifications: 

4.  Enhancement  of  your  research  qualifications: 

5.  Lab  readiness  for  you:  LFP,  task,  plan: 

6.  Lab  readiness  for  you:  equipment,  supplies,  facilities: 

7.  Lab  resources: 

8.  Lab  research  and  administrative  support: 

9.  Adequacy  of  brochure  and  associate  handbook: 

10.  RDL  communications  with  you: 

11.  Overall  payment  procedures: 

12.  Overall  assessment  of  the  SRP: 

13.  a.  Would  you  apply  again? 

b.  Will  you  continue  this  or  related  research? 

14.  Was  length  of  your  tour  satisfactory? 

15.  Percentage  of  associates  who  experienced  difficulties  in  finding  housing: 

16.  Where  did  you  stay  during  your  SRP  tour? 


a. 

At  Home: 

b. 

With  Friend: 

c. 

On  Local  Economy: 

d. 

Base  Quarters: 

Value 

of  orientation  visit: 

a. 

Essential: 

b. 

Convenient: 

c. 

Not  Worth  Cost: 

d. 

Not  Used: 

SFRP  and  GSRP  associate’s  responses  are  listed  in  tabular  format  on  the  following  page. 
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Table  B-4.  1996  SFRP  &  GSRP  Associate  Responses  to  SRP  Evaluation 


Arnold  1 

Brooks 

Edvrardfl 

Egfin 

Griffis 

Hanscom 

Kdly 

Khthnd 

Lackland 

Robins  1 

Tyndall 

WPAFB  1 

itwnge 

# 

res 

48 

6 

14 

31 

19 

3 

32 

i 

10 

257 

i 

4.8 

ESI 

mm 

ESI 

mm 

mm 

EH 

mm 

nn 

m 

4.0 

mm 

mm 

2 

KM 

mm 

mm 

4.9 

mm 

4.7 

EDI 

mm 

El 

ESI 

4.6 

4.8 

ESI 

3 

mm 

4.4 

mm 

gfl 

4.3 

4.2 

4.3 

mm 

k&JK 

EM 

4.3 

ESI 

4 

4.3 

3.8 

EH 

ESI 

4.4 

4.3 

mm 

mm 

EX9 

kei 

ELI 

ESI 

5 

4.3 

3J 

4.8 

ESI 

4.5 

4J 

42 

M.*iH 

E 1 

3.9 

KSE 

ESI 

6 

4 3 

4.3 

K9 

ESI 

ESI 

4.5 

EH 

3.8 

Bn 

El 

3.8 

4.2 

_J2_l 

7 

mm 

ESI 

4.2 

4.8 

mm 

43 

4.3 

4.1 

Bn 

El 

4.3 

43 

ESI 

8 

WT9 

mm 

EM 

E9 

mm 

4.3 

EH 

mm 

5.0 

miM 

mm 

El 

mm 

9 

ESI 

w*m 

mm 

EH 

4.3 

mrm 

ESI 

43 

WEm 

KM 

mm 

EB 

bee 

10 

mm 

m m 

mm 

EH 

ESI 

4.1 

EH 

42 

mam 

KSE 

^31 

11 

mm 

wtm 

iffl 

4.1 

EH 

mm 

Bn 

ESI 

ESI 

ESI 

12 

4.3 

ESI 

ei 

4.9 

ESI 

mm 

KB 

EH! 

mm 

bib 

ESI 

Numbers  below  are 

percentages 

13a 

83 

90 

83 

93 

87 

75 

100 

81 

100 

100 

100 

86 

87 

13b 

100 

89 

83 

100 

94 

98 

100 

94 

100 

100 

100 

94 

93 

14 

83 

96 

100 

90 

87 

Km 

100 

92 

100 

100 

70 

84 

88 

15 

17 

6 

IK1H 

33 

20 

76 

33 

25 

0 

100 

20 

8 

39 

16a 

26 

17 

9 

38 

23 

33 

4 

- 

- 

- 

30 

16b 

100 

33 

40 

_ 

8 

- 

- 

- 

- 

36 

2 

16c 

41 

83 

40 

62 

69 

67 

96 

100 

100 

64 

68 

16d 

. 

. 

- 

- 

- 

- 

- 

- 

17a 

_ 

33 

100 

17 

mm 

14 

67 

39 

- 

50 

40 

31 

35 

17b 

_ 

21 

• 

17 

10 

14 

- 

24 

- 

20 

16 

16 

17c 

• 

. 

10 

7 

- 

- 

- 

- 

- 

2 

3 

17d 

100 

46 

- 

66 

30 

69 

33 

37 

100 

L_ 

40 

51 

46 
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4.  1996  USAF  LABORATORY  HSAP  MENTOR  EVALUATION  RESPONSES 

Not  enough  evaluations  received  (5  total)  from  Mentors  to  do  useful  summary. 
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5.  1996  HSAP  EVALUATION  RESPONSES 


The  summarized  results  listed  below  are  from  the  1 13  HSAP  evaluations  received. 

HSAP  apprentices  were  asked  to  rate  the  following  questions  on  a  scale  from 
1  (below  average)  to  5  (above  average) 

1 .  Your  influence  on  selection  of  topic/type  of  work. 

2.  Working  relationship  with  mentor,  other  lab  scientists. 

3.  Enhancement  of  your  academic  qualifications. 

4.  Technically  challenging  work. 

5.  Lab  readiness  for  you:  mentor,  task,  work  plan,  equipment. 

6.  Influence  on  your  career. 

7.  Increased  interest  in  math/science. 

8.  Lab  research  &  administrative  support. 

9.  Adequacy  of  RDL’s  Apprentice  Handbook  and  administrative  materials. 

10.  Responsiveness  of  RDL  communications. 

1 1 .  Overall  payment  procedures. 

12.  Overall  assessment  of  SRP  value  to  you. 

13.  Would  you  apply  again  next  year?  Yes  (92  %) 

14.  Will  you  pursue  future  studies  related  to  this  research?  Yes  (68  %) 

15.  Was  Tour  length  satisfactory?  Yes  (82  %) 


Arnold 

Brooks 

Edwards 

EgHn 

GrifBss 

Hanscom 

WPAFB 

Totals 

tt 

5 

19 

7 

15 

13 

2 

7 

5 

40 

113 

resp 

i 

2.8 

3.3 

3.4 

3.5 

3.4 

3.2 

3.6 

3.6 

3.4 

2 

4.4 

4.6 

4.5 

4.8 

4.6 

4.4 

4.0 

4.6 

4.6 

3 

4.0 

4.2 

4.1 

4.3 

4.5 

MEM 

4.3 

4.6 

4.4 

4.4 

4 

3.6 

3.9 

4.5 

4.2 

MM 

4.6 

3.8 

4.3 

4.2 

5 

4.4 

4.1 

4.1 

MEM 

3.9 

3.6 

3.9 

4.0 

6 

3.2 

3.6 

mXm 

3.8 

MM 

3.3 

3.8 

3.6 

3.7 

7 

4.1 

3.9 

3.9 

5.0 

3.6 

4.0 

3.9 

8 

4.1 

4.3 

4.0 

4.0 

4.3 

3.8 

4.3 

4.2 

9 

4.4 

3.6 

4.1 

4.1 

3.5 

4.0 

3.9 

■SI 

3.7 

3.8 

3.8 

4.1 

3.7 

4.1 

4.0 

3.9 

ma 

3.8 

3.8 

■a 

mm 

3.7 

3.9 

3.8 

MEM 

3.7 

2.6 

3.7 

3.8 

19 

4.9 

4.6 

4.6 

mEm 

4.6 

4.2 

4.3 

4.5 

Numbers  below  are  percentages 

13 

60% 

95% 

100% 

100% 

85% 

100% 

100% 

100% 

90% 

92% 

14 

20% 

80% 

71% 

80% 

54% 

100% 

71% 

80% 

65% 

68% 

15 

100% 

70% 

71% 

100% 

100% 

50% 

86% 

60% 

80% 

82% 
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Abstract 


Forward  looking  infrared  (FLIR)  detector  arrays  generally  produce  low  resolution  images 
because  the  FLIR  arrays  can  not  be  made  sufficiently  dense  to  yield  a  high  sampling  frequency 
using  the  current  technology.  Microscanning  is  an  effective  technique  for  reducing  aliasing  and 
increasing  resolution  in  images  produced  by  staring  infrared  imaging  systems,  which  involves 
recording  a  sequence  of  frames  through  subpixel  movements  of  the  field  of  view  on  the  detector 
array  and  then  interlacing  them  to  produce  a  high  resolution  image.  The  FLIR  system  is  usually 
mounted  on  a  moving  platform,  such  as  an  aircraft,  and  the  normal  vibrations  associated  with  the 
moving  platform  can  be  used  to  generate  shifts  in  the  FLIR  recorded  sequence  of  frames.  Since  a 
fixed  number  of  frames  is  required  for  a  given  level  of  microscanning,  and  the  shifts  are  random, 
some  of  the  acquired  frames  may  have  almost  similar  shifts  thus  making  them  unusable  for  high 
resolution  image  reconstruction.  In  this  paper,  we  utilize  a  modified  version  of  the  algorithm 
reported  in  Ref.  1  for  estimating  the  shifts  among  the  acquired  frames  and  then  utilize  a  k- 
nearest-neighbor  approach  for  estimating  the  above  mentioned  missing  frames  to  form  the  final 
high  resolution  image.  Blurring  by  the  detector  and  optics  of  the  imaging  system  limits  the 
increase  in  image  resolution  when  microscanning  is  attempted  at  sub-pixel  movements  of  less 
than  half  the  detector  width.  We  resolve  this  difficulty  by  the  application  of  Wiener  filter, 
designed  using  the  MTF  of  the  imaging  system,  to  the  microscanned  images.  Computer 
simulation  and  experimental  results  are  presented  to  verify  the  effectiveness  of  the  proposed 
technique.  This  technique  is  significantly  faster  than  the  alternate  techniques,  and  is  found  to  be 
especially  suitable  for  real  time  applications. 
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1  Introduction 


In  many  Forward  Looking  Infrared  (FLIR)  imaging  systems,  the  detector  spacing  in  the  focal 
plane  array  is  not  sufficiently  small  so  as  to  sample  a  band-limited  scene  at  the  Nyquist  rate, 
resulting  in  a  degraded  image  due  to  aliasing  artifacts.  The  construction  of  a  focal  plane  array 
with  smaller  and  more  closely  spaced  detector  elements  is  very  difficult  and  prohibitively 
expensive  due  to  fabrication  complexity  and  quantum  efficiency  problems.  Over  the  last  four 
years,  the  Electro-Optics  Branch  of  Wright  Laboratory  has  been  developing  a  microscan  imaging 
technique  to  increase  the  spatial  sampling  rate  of  existing  focal  plane  arrays  and  to  alleviate  the 
aliasing  effects  in  infrared  imagery2.  The  microscanning  process  uses  a  sequence  of  spatially 
undersampled  time  frames  of  a  scene  to  generate  the  high  resolution  image.  Each  frame  is 
subpixel  shifted  relative  to  each  other  frame  onto  a  set  grid  pattern.  The  sequence  of  frames  are 
then  interlaced  to  yield  the  high  resolution  image  which  represents  the  input  scene  effectively 
sampled  at  a  higher  spatial  frequency.  The  aforementioned  process  is  called  controlled 
microscanning  because  the  subpixel  shifts  between  the  temporal  image  frames  are  controlled  and 
therefore  known  a  priori.  In  this  paper,  we  consider  a  practical  scenario  where  an  imager  is 
mounted  on  a  moving  and/or  vibrating  platform,  such  as  an  aircraft.  Consequently,  controlled 
microscanning  can  not  be  used  for  such  applications.  Uncontrolled  microscanning  is  the  process 
where  the  shifts  for  each  frame  are  unknown  and  must  be  estimated  before  registering  onto  the 
high  resolution  grid  pattern.  The  shifts  for  each  frame  are  unknown  because  they  are  generated  by 
the  random  motion  and/or  vibration  of  the  imager  platform  and  not  by  a  microscan  mirror.  Using 
random  shifts  alone  eliminates  the  necessity  for  a  microscan  mirror  and  driver  system.  Since  a 
fixed  number  of  frames  is  required  for  a  given  level  of  microscanning,  and  the  shifts  are  random, 
some  of  the  acquired  frames  may  have  almost  similar  shifts  which  will  create  empty  bins  in  the 
high  resolution  image  reconstruction.  Thus,  the  key  factors  for  the  high  resolution  image 
reconstruction  are  the  accurate  knowledge  of  the  subpixel  translational  motion  of  the  scene 
relative  to  the  FLIR  array  and  the  accurate  estimation  of  the  missing  frames  (a  frame  whose 
translational  shifts  are  similar  to  another  frame  or  a  frame  which  does  not  exist  corresponding  to 
a  desired  grid  pattern).  In  this  paper,  we  have  utilized  a  modified  version  of  the  algorithm 
reported  in  Ref.  1  for  estimating  the  shifts  between  the  acquired  frames  and  then  used  a  k- 
nearest-neighbor  approach  for  estimating  the  aforementioned  missing  frames.  The  proposed 
algorithm  is  significantly  faster  than  alternate  techniques3'4  and  is  found  to  be  suitable  for  real 
time  applications. 

With  the  increase  of  microscan  level  (i.e.,  smaller  and  smaller  subpixel  movements),  we  expect 
to  see  a  proportional  increase  in  resolution  of  the  final  reconstructed  image.  However,  attempts  at 
increasing  the  resolution  beyond  the  Nyquist  rate  produce  images  having  the  same  general 
appearance  independent  of  microscan  level.  This  bottleneck  in  the  microscanning  process  was 
found  to  be  primarily  caused  by  the  blurring  inherent  to  the  system’s  modulation  transfer 
function  (MTF).  The  main  contributors  to  the  system  MTF  are  the  optical  transfer  function  and 
the  detector  transfer  function.  Since  all  parameters  of  the  imager  optics  and  detector  array  are 
known,  it  is  possible  to  accurately  model  the  system  MTF.  The  application  of  a  Wiener  filter 
was  found  to  be  an  effective  means  of  removing  the  blurring  caused  by  the  system  MTF  and 
improving  the  resolution  of  microscanned  image  data.  Computer  simulations  were  performed  by 
varying  the  level  of  microscanning  along  with  the  optical  transfer  function  and  detector  transfer 
function.  The  resulting  images  are  then  compared  with  those  images  produced  after  using  Wiener 
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filtering  to  deconvolve  the  optical  and  detector  blurring  functions  out  of  the  microscan  images. 
Note  that  the  Wiener  filter  restoration  is  done  in  the  discrete  domain  on  the  sensor’s  discrete 
output  image.  However,  the  blurring  of  an  image  by  the  optics  and  detectors  of  the  sensor  is  a 
continuous  process.  Therefore  the  continuous  blurring  function  must  be  mapped  to  a  discrete 
blurring  function  to  be  used  to  create  a  discrete  Wiener  filter  for  restoration.  To  avoid  aliasing  in 
the  continuous  to  discrete  conversion  of  the  function,  the  continuous  function  must  be  sampled  at 
a  high  enough  frequency  so  as  to  meet  the  Nyquist  rate.  Microscanning  increases  this  sampling 
rate  and  allows  the  continuous  functions  to  be  properly  mapped  and  used  to  form  the  discrete 
Wiener  filter  for  restoration. 

A  considerable  understanding  of  the  advantages  and  limitation  of  the  Wiener  filtering  technique 
was  acquired  from  the  information  compiled  during  the  microscan  simulation.  The  knowledge 
acquired  from  the  simulation  was  applied  to  restoring  imagery  collected  from  a  real-time  FT.TR 
imaging  system.  From  the  simulation  and  experimental  results,  it  is  evident  that  the  proposed  k- 
nearest-neighbor  approach  for  estimating  the  missing  frames  significantly  improves  the  image 
quality.  The  resolution  of  the  final  high  resolution  image  can  be  improved  further  by  the 
application  of  Wiener  filtering,  especially  for  alleviating  the  effects  of  blurring  caused  by  system 
MTF. 

2  The  Sampling  Process 
2.1  Sampling  in  a  FLIR  Array 

In  a  FLIR  array,  sampling  is  performed  by  a  finite  sized  array  of  detectors  and  three  main  factors 
must  be  taken  into  consideration:  the  optical  point  spread  function,  the  detector  charge 
integration,  and  the  detector  array  geometry.  The  block  diagram  in  Fig.  1  illustrates  the  sampling 
process  in  a  FLIR  array.  The  object  scene,  denoted  o(x,y),  is  convolved  with  the  point  spread 
function  of  the  optics,  psf(x,y),  and  the  aperture  function  for  the  square  detector6,  d(x,y). 


Finite  Detector  Sampling 
Array  Function 

Figure  1.  Block  diagram  of  the  imaging  system. 


For  our  imaging  system,  we  assume  the  detector  has  a  flat  response  across  its  active  region  so  the 
detector  function  can  be  expressed  as: 


d(x,y) 


|  ab\reCt 


A 

U  'b)' 


(1) 


where  a  and  b  are  the  dimensions  of  the  individual  detectors.  The  result  is  multiplied  by  a 
function  representing  the  limited  extent  of  the  detector  array,  r(x,y),  expressed  as 
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r(x,y )  =  red 


x_  y_ 
X’Y 


where  X  and  Y  are  the  dimensions  of  the  array.  To  apply  this  integration  to  all  of  the  detectors,  it 
is  multiplied  by  the  sampling  lattice,  combAx.Ay(x,y), 

comb^lx.y)  =T7T-  I  ~m,  j~-n  =IIS  (x-m&x.y-n&y)  (3) 

AX  Ay  m=-~  n  =  —  V  A/  J  m=-~n=-~ 

where  Ax  and  Ay  are  the  center-to-center  detector  spacings.  The  resulting  expression  for  the 
staring  image  is 

i(x,y)=  o(x,y)*psf(x,y)*  'rect(~t’  comb^>(x’ y)  ■  (4) 

An  illustration  of  a  uniform  detector  array  showing  critical  dimensions  is  shown  in  Fig.  2. 


Figure  2.  The  detector  array. 


2.2  Controlled  Microscanning 

In  controlled  microscanning,  the  subpixel  shifts  between  temporal  image  frames  are  controlled 
and  therefore  known  in  advance.  A  level  L  microscan  is  defined  to  be  the  case  where  L  staring 
frames  are  acquired  for  each  high  resolution  frame  and  each  staring  frame  has  its  own  unique 
controlled  subpixel  shift,  each  shift  being  part  of  a  uniform  grid.  Thus,  each  staring  frame  has  a 
shift  that  is  integer  multiple  of  1/L  times  the  detector  width.  The  staring  frames  are  interlaced  in 
an  LxL  pattern  to  produce  a  high  resolution  frame  of  size  NLxNL  where  N  is  the  size  of  the 
square  detector  array.  For  a  level  2  or  2x2  microscanned  image,  the  original  scene  is  stepped  one 
half  the  length  of  the  detector  in  the  x  and  y  directions  producing  a  series  of  four  staring  images. 
These  images  are  then  interlaced  to  produce  the  resulting  microscan  image  of  2Nx2N  pixels,  as 
shown  in  Fig.  3  where  the  reconstructed  image  has  4  times  the  number  of  unique  samples  as  any 
of  the  four  frames.  Thus,  a  level-L  microscanning  effectively  increases  the  sampling  frequency 
by  L  without  changing  the  detector  size  or  spacing.  Since  the  effect  of  the  finite  detector  array 
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r(x,y)  is  small  compared  to  the  effects  of  the  detector  integration  and  the  psf,  neglecting  the  effect 
of  r(x,y),  an  LxL  microscan  process  can  be  expressed  as 


i(x,  y)  =  o(x,y)*  psf  (x,  y)*  j^rect 


L-l  L- 1  /  j  j 

X  X  COmbA  ,.AV  X  ~  T  ’y  ~  T 


•-  i  =  0  ;  =  0 


where  the  factor  of  1/L2  is  used  to  adjust  for  the  reduction  in  detector  integration  time  at  each  of 
the  L2  microscan  steps. 


Detector  Size 


Figure  3.  Illustration  of  a  level  2  microscanning  process. 


2.3  Uncontrolled  Microscanning 


Uncontrolled  microscanning  results  from  practical  applications  where  the  imager  is  mounted  on  a 
moving  and/or  vibrating  platform,  such  as  an  aircraft,  and  the  vibrations  associated  with  the 
platform  can  be  exploited  to  create  the  shifts  in  the  acquired  frames.  In  uncontrolled 
microscanning,  the  shifts  for  each  recorded  frame  are  unknown  and  must  be  obtained  before 
forming  an  estimate  of  the  high  resolution  image.  Because  the  shifts  are  random  in  uncontrolled 
microscanning,  it  eliminates  the  need  for  a  microscan  mirror  and  driver  system  otherwise 
required  in  controlled  microscanning.  Since  a  fixed  number  of  frames  is  required  for  a  given 
level  of  microscanning,  and  the  shifts  are  random,  some  of  the  acquired  frames  may  have  almost 
similar  shifts  or  unusable  shifts  which  will  generate  empty  bins  in  the  high  resolution  image 
reconstruction.  Thus,  the  key  factors  for  the  high  resolution  image  reconstruction  are  the  accurate 
knowledge  of  the  subpixel  translational  motion  of  the  scene  relative  to  the  FLIR  array  and  the 
accurate  estimation  of  the  missing  frames  for  filling  the  empty  bins  in  the  high  resolution  image. 
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3  Image  Registration 


For  uncontrolled  microscanning,  the  movement  of  the  scene  on  the  array  is  produced  by  random 
motion  and/or  vibration  of  the  imager  platform.  Therefore  the  image  shifts  are  unknown  for  each 
recoreded  frame.  It  is  necessary  to  develop  a  suitable  algorithm  which  can  accurately  estimate  the 
registration  parameters  i.e.,  in  our  case,  the  random  translational  shifts.  Various  algorithms  have 
been  reported  in  the  literature  for  estimating  the  image  registration  parameters2'7'8.  Among  these 
techniques,  the  gradient  based  registration  technique  proposed  in  Ref.  2  appears  to  be  particularly 
attractive  for  practical  applications.  However,  this  technique  only  works  for  small  shifts.  To 
accommodate  the  case  where  larger  shift  values  are  anticipated,  we  utilized  the  iterative 
technique  proposed  in  Ref.  7  and  incorporated  it  with  the  registration  algorithm  proposed  in  Ref. 
1.  In  this  algorithm,  the  first  acquired  frame  is  considered  to  be  the  reference  frame  and  the  shifts 
associated  with  the  remaining  frames  are  calculated  with  respect  to  the  reference  frame.  If  p 
represents  the  total  number  of  frames  acquired  by  the  imager,  and  hk  and  vk  represents  the  shifts 
in  the  horizontal  and  vertical  directions  for  the  kth  frame,  then  the  observed  kth  frame  may  be 
expressed  as 


ok(x,y)=ol(x  +  hk,y  +  vk),  (6) 

where  k  €  { 2 ,  3,  4,  ...,  p}  and  for  the  reference  frame,  hj  =  v}  =  0.  Considering  the  first  three 
terms  for  the  Taylor  series  expansion,  Eq.  (6)  may  be  approximated  as 


,  ,  dox(x,y)  dox(x,y) 

ok(x,y )  ~ox(x,y)  +  hk - —  +  vk 


(7) 


Since  Eq.  (7)  is  an  approximation  itself  and  discrete  estimates  of  x  and  y  must  be  used,  it  is  useful 
to  apply  the  method  of  least  squares  for  solving  the  registration  parameters  hk  and  vk.  For  the  least 
squares  solution9  the  error  expression 
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must  be  minimized,  where  m  and  n  are  discrete  variables  for  the  x  and  y  directions,  M  and  N 
represents  the  total  number  of  pixels  in  the  x  and  y  directions,  and  hk  and  v*  are  the  translational 
shifts  in  the  x  and  y  directions  between  the  kth  frame  and  reference  frame.  Equation  (8)  can  be 
minimized  by  differentiating  Ek(hk,vk)  with  respect  to  hk  and  vk  and  setting  the  derivatives  equal 
to  zero.  This  yields  two  equations  which  must  be  solved  simultaneously  and  can  be  conveniently 
represented  in  the  following  matrix  form: 
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(9) 
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Equation  (9)  can  be  represented  in  a  short  form  as 

M  -S  =V 

where  S  =  [hk  Vk  ]r. 
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(10) 


Therefore,  the  estimated  registration  parameters  can  be  calculated  as 

S  =  M  "V.  (11) 

To  incorporate  the  case  involving  larger  shift  values,  we  follow  the  iterative  technique  of  Ref.  7. 
At  first,  the  initial  registration  parameters  are  estimated  using  Eq.  (11).  Using  these  estimated 
shift  parameters,  Ok(m,n)  is  shifted  so  as  to  closely  match  oj(m,n).  The  resulting  image  is  then 
registered  to  o\(m,n).  Using  the  above  mentioned  procedure,  oAm,n)  is  continuously  modified 
until  the  registration  estimates  become  sufficiently  small.  The  final  registration  estimate  is 
obtained  by  summing  all  of  these  partial  estimates. 

4  High  Resolution  Image  Reconstruction 

4.1  Missing  Frame  Estimation 

We  define  a  missing  frame  as  the  frame  whose  translational  shifts  are  similar  to  another  frame  or 
a  frame  which  does  not  exist  within  a  desired  grid  pattern.  Since  the  translational  shifts  are 
random,  some  of  the  required  frames  for  a  given  level  of  microscanning  may  not  exist  for  the 
high  resolution  image  reconstruction.  For  instance,  let  us  assume  that  frame  3  of  the  2x2 
microscanning  example  shown  in  Fig.  3  is  missing.  The  high  resolution  image  reconstructed 
from  the  three  available  frames  (1,2  and  4)  is  shown  in  Fig.  4  where  the  empty  spots  corresponds 
to  the  missing  pixels  of  frame  3.  Therefore,  such  missing  frames  must  be  estimated  before 
attempting  the  high  resolution  image  reconstruction.  Since  the  technique  proposed  herein  is 
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intended  for  real  time  applications,  we  must  devise  a  computationally  inexpensive  algorithm  for 
estimating  the  missing  frames.  Accordingly,  we  have  used  a  ^-nearest-neighbor  (k  =  8)  approach 
for  estimating  the  missing  frame  pixels.  In  this  approach,  a  missing  frame  pixel  (i,j)  in  the  high 
resolution  image/*  may  be  obtained  as  the  average  of  its  neighbors,  and  can  be  expressed  as 

~  X  X  fh(i  +  m,j  +  n)  (12) 

m=- 1  n=- 1 

where  m  and  n  can  not  be  zero  simultaneously. 


1 _ .  2 


4 _ * 

Microscan 

Pattern 


Figure  4.  Level  2  microscanning  process  of  Fig.  3  with  frame  3  missing. 

4.2  Blurring  Effects 

Let  us  now  focus  our  attention  towards  the  effect  of  image  blurring  and  the  use  of  the  Wiener 
filter  as  a  means  of  image  restoration.  For  most  FLIR  imaging  systems,  there  are  two  main 
sources  of  image  blurring  effects  -  the  system  optics  and  the  finite  detector  size.  Since  the 
imager  optics  and  detector  parameters  are  known,  one  can  accurately  model  the  system  MTF, 
which  is  a  good  representation  of  the  overall  system  blurring  function10.  To  deconvolve  the  effect 
of  blurring,  one  can  design  a  function  using  the  system  MTF  which,  when  multiplied  (convolved 
in  the  spatial  domain)  with  the  Fourier  transform  of  the  degraded  image,  will  eliminate  the 
system  MTF  and  therefore  the  effects  of  blurring.  This  is  precisely  what  is  accomplished  by  the 
application  of  inverse  filtering  techniques  such  as  Wiener  filtering. 

4.3  Wiener  Filtering 

It  is  possible  to  improve  the  image  restoration  with  respect  to  the  noise  by  incorporating  some  a 
priori  knowledge  of  the  statistical  makeup  of  the  noise.  This  is  accomplished  by  an  improved 
inverse  filtering  technique  known  as  Wiener  filtering.  The  Wiener  filter  is  a  linear  filter  which  is 
designed  to  minimize  the  mean  squared  error  between  a  given  image  and  a  desired  image. 
Wiener  filters  are  mathematically  similar  to  traditional  inverse  filters  with  the  addition  of  an 
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expression  which  attempts  to  minimize  noise  amplification  by  employing  the  power  spectra  of 
the  original  image  o(x,y),  and  of  the  noise  n(x,y).  The  continuous  Wiener  filter9,  Hcw(u,v)  is 
represented  by  the  following: 

(u,v) _ 

nn  (^»V)  /  S ff  (M,  V 


Hcw{u,v)  = 


H 


ltf(u,v)l2+y[s 


where  Snn(u,v)  and  Sjrfu.v)  are  the  power  spectral  densities  of  the  noise  and  original  image,  H(u,v) 
is  the  Fourier  transform  of  the  system  point  spread  function,  and  y  is  a  statistical  design  constant 
usually  set  equal  to  1.  Since  the  original  image  is  unavailable  and  it  is  difficult  to  characterize 
the  noise,  Snn( u,v)  and  Sg(u,v)  are  unknown,  Hcw(u,v)  can  be  approximated  by 


Hcw(u,v) 


H\u,v) 
\H(u,vf+  nsr 


(14) 


where  nsr  =  Snn(u,v)/S^(u,v)  is  a  constant  representing  the  noise-to-signal  ratio  and  nsr  e  [0,  l]7. 


The  continuous  optics  and  detector  MTFs  are  shown  Figs.  5a  and  5b  while  the  continuous  system 
MTF  is  shown  in  Fig.  5c.  From  Fig  5a,  it  is  evident  that  the  optics  cut  off  at  83.33cyc/mm  and 
form  Fig.  5b  we  observe  that  the  first  zero  of  the  detector  sine  occurs  at  25cyc/mm.  Figure  5d 
shows  the  continuous  Wiener  Filter  obtained  from  Eq.  (14)  using  the  continuous  system  MTF 
shown  in  Fig  5c  where  the  parameter  nsr  is  set  to  0.01.  Notice  that  the  optical  and  detector  blurs 
are  both  continuous  functions  acting  on  a  continuous  scene.  However,  the  output  of  the  FI.TR  is  a 
discrete  image,  and  image  restoration  is  done  in  the  discrete  domain.  Therefore,  a  discrete 
Wiener  filter  that  works  like  a  continuous  filter  must  be  used  to  restore  the  image.  The  effective 
discrete  filter  must  have  the  same  frequency  response  as  the  continuous  filter  shown  in  Fig.  5c. 
To  successfully  map  a  continuous  function  to  discrete,  the  continuous  function  must  be  sampled 
at  a  high  enough  frequency  so  as  to  avoid  aliasing  in  the  continuous  to  discrete  conversion  of  the 
function".  The  mapping  of  the  continuous  Wiener  filter,  Hcw(u,v),  to  a  discrete  Wiener  filter, 
HdW(mi,ni)  is  given  by12: 


Hdw(ml,nl)  =  H*(u,v),  where 


Lm] 

2  k Ax  ’ 


L«, 

2ntsy  ’ 


and  0  <  m,  ,  n,  <  k 


(15) 


For  HdW(mi,ni)  to  operate  like  Hcw(u,v)  as  a  linear  system,  the  spatial  sampling  interval  Ax/L 
must  be  selected  such  that: 

H'(u,v)  =  0,  where  |m|  ,  |v|  >  .  (16) 

2Ax  2  Ay 

The  above  condition  truncates  the  continuous  Wiener  filter  outside  the  foldover  frequencies  and 
prevents  aliasing  in  the  discrete  version  of  the  filter.  The  sampling  interval  is  dictated  by  the 
detector  spacing  and  microscan  level.  Increasing  the  microscan  level  will  reduce  the  sampling 
interval  and  allow  the  condition  of  Eq.  (16)  to  be  met.  If  the  cutoff  of  the  continuous  system 
MTF  is  known,  the  required  microscan  level  can  be  calculated  to  meet  this  condition.  In  the 
present  case  where  the  MTF  cuts  off  at  83.33  cycles/mm,  and  Ax  =  Ay  =  0.05  mm,  a  microscan 
level  greater  than  8  will  eliminate  aliasing  in  the  discrete  Wiener  filter.  Figure  6  shows  plots  of 
the  discrete  system  MTFs  for  various  microscan  levels.  These  plots  show  that  at  lower  microscan 
levels  the  discrete  system  MTFs  suffer  from  aliasing  and  are  cutoff  by  the  mapping  process  and 
therefore  contain  only  the  information  in  the  central  region  of  the  continuous  system  MTF. 
Notice  that  the  discrete  system  MTF  resembles  the  continuous  system  MTF  shown  in  Fig.  5c 
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only  when  the  microscan  level  approaches  8.  The  discrete  system  MTFs  of  Fig.  6  are  used  in  Eq. 
(14)  to  generate  the  discrete  Wiener  filters  shown  in  Fig.  7.  Microscanning  at  a  level  of  8  or 
higher  allows  us  to  use  a  discrete  Wiener  filter  (Fig.  7c)  which  resembles  the  continuous  Wiener 
filter  shown  in  Fig  5d.  Although  lower  microscan  levels  produce  an  aliased  filter  for  restoration, 
the  aliasing  can  be  treated  as  noise  and  the  Wiener  filter  nsr  parameter  can  be  adjusted  to  achieve 
limited  deblurring. 


Figure  5  a.  Continuous  Optical  MTF. 


Figure  5b.  Continuous  Detector  MTF. 


Figure  5c.  Continuous  System  MTF.  Figure  5d.  Continuous  Wiener  Filter. 

5  Computer  Simulation  Results 

A  MATLAB  program  was  written  to  simulate  the  microscanning  process  under  various 
conditions  such  as  detector  pitch,  shape,  and  size  of  the  active  region  as  well  as  other  parameters 
such  as  lens  configuration.  At  first,  we  investigated  the  effects  of  aliasing  at  various  microscan 
levels  without  using  Wiener  filtering.  A  512x512  image  taken  from  a  digitized  photograph, 
shown  in  Fig.  8a,  was  used  as  the  original  input  scene  to  be  blurred  by  the  system  MTF,  have 
noise  added,  and  be  sampled  by  the  detector  array.  Microscanning  was  then  performed  at  various 
levels  and  under  a  variety  of  detector  and  lens  configurations.  Each  detector  in  the  array  was 
simulated  with  a  8x8  kernel  where  each  kernel  returns  the  average  of  all  pixels  from  the  input 
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image  which  fall  upon  it.  Figure  8b  shows  the  highly  undersampled  image  created  by  sampling 
in  the  staring  mode.  The  translational  shifts  of  all  frames  were  calculated  relative  to  the  first 
frame  (i.e.,  the  reference  frame)  using  the  image  registration  algorithm  discussed  in  Section  3. 
For  instance,  the  theoretical  shift  values  associated  with  each  of  the  sixteen  frames  for  4x4 
microscanning  are  shown  in  Fig.  9.  The  same  shift  parameters  are  then  estimated  using  the 
registration  algorithm  developed  in  Section  3  which  is  also  depicted  in  Fig.  9.  It  is  obvious  that 
the  error  between  the  theoretical  and  estimated  shift  values  are  almost  negligible.  The  results 
from  2x2,  4x4,  and  8x8  microscanning  are  shown  in  Fig.  10.  For  each  case  of  microscanning, 
one  out  of  every  four  frames  were  considered  to  be  missing  which  results  in  the  empty  bins 
shown  in  Fig.  10.  The  missing  frames  were  then  estimated  using  the  8 -nearest-neighbor  approach 
discussed  in  Section  4.1.  The  estimated  frames  were  used  to  fill-in  the  empty  bins  of  Fig.  10  and 
the  resulting  images  are  shown  in  Fig.  11.  From  Fig.  11,  it  is  obvious  that  there  is  significant 
improvement  in  the  high  resolution  image  quality  after  the  inclusion  of  estimated  missing  frames. 
Furthermore,  there  is  substantial  improvement  in  resolution  between  the  2x2  microscan  image 
and  the  images  created  using  higher  microscanning  levels.  Figure  11c  shows  that  although 
aliasing  may  have  been  eliminated  from  the  8x8  microscanned  image,  this  image  is  still  blurred 
by  the  system  MTF.  To  eliminate  the  blurring  caused  by  system  MTF,  Wiener  filtering  operation 


Figure  8a.  Figure  8b. 


Figure  8.  Microscan  simulation:  (a)  original  scene,  and  (b)  staring  frame  sampled  by 

simulated  detector  array. 

was  applied  to  the  aforementioned  microscan  images.  It  was  assumed  that  the  power  spectral 
density  for  both  the  original  image  and  the  noise  were  unknown  and  the  Wiener  filter  was 
developed  using  Eq.  (14).  Figure  12  shows  the  result  of  applying  Wiener  filtering  to  the 
microscan  images  in  Fig.  11.  It  is  evident  that  with  the  increase  in  microscan  levels,  Wiener 
filtering  further  improves  the  image  resolution.  Comparing  Figs.  1 1c  and  8a,  we  observe  that  the 
Wiener  filtered  8x8  microscan  image  is  not  exactly  identical  to  the  original  image.  This  is  due  to 
the  fact  that  the  system  MTF  optically  band-limits  the  scene  at  8.33  cycles  per  milliradian  and 
truncates  all  frequency  information  beyond  that  point.  Frequency  information  from  the  original 
image  beyond  the  cutoff  can  not  be  recovered.  This  explains  why  the  Wiener  filtered  8x8 
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microscan  image  is  not  exactly  the  same  as  the  original  input  image.  Therefore,  the  application  of 
higher  levels  of  microscanning  may  not  yield  better  results. 


x  -  Theoretical  o  -  Shifts  Associated  with  Simulated  Frames 


Horizontal  shift  h_k 

Fig.  9.  Theoretical  and  estimated  registration  parameters  for  16  simulated  frames. 

6  Experimental  Results 

For  experimental  verification  of  the  proposed  algorithm,  we  applied  the  multiframe 
reconstruction  and  Wiener  filtering  technique  to  data  collected  on  a  real-time  FLIR  imaging 
system.  The  FLIR  camera  uses  a  128x128  Amber  AE-4128  infrared  focal  plane  array  with  a  100 
millimeter  focal  length  f/3  lens.  The  focal  plane  array  uses  Indium-Antimonide  (InSb)  detectors 
with  a  response  in  the  3  to  5  micron  wavelength  band.  The  100  millimeter  lens  was  diffraction 
limited  producing  an  optical  cutoff  of  8.33  cycles  per  milliradian.  The  system  uses  square 
detectors  of  size  50  microns  with  an  80%  fill  factor,  making  the  active  detector  size  of  40 
microns  with  the  remaining  10  microns  being  inactive.  The  information  about  the  detector  and 
lens  configuration  was  used  in  our  estimation  of  the  system  MTF,  which  was  used  for  designing 
the  Wiener  filter  during  the  computer  simulation.  The  FLIR  image  data  collected  at  Wright 
Laboratory  using  the  Amber  imaging  system  is  shown  in  Figure  13.  Three  primary  targets  at  a 
range  of  approximately  0.9  kilometers  are  contained  in  the  image:  a  military  truck,  a  civilian 
vehicle,  and  an  M-60  tank.  Figure  13a  is  one  of  the  16  staring  image  frames  collected  for  a  4x4 
microscan  high  resolution  image.  The  staring  image  was  then  pixel  replicated  to  the  same  size  as 
the  microscan  image  for  comparison.  The  4x4  microscan  image  is  shown  in  Fig.  13b  where  the 
balnk  horizontal  lines  shows  that  four  (frames  13,  14,  15  and  16)  out  of  the  total  of  sixteen 
frames  recorded  have  shifts  similar  to  other  frames  i.e.,  four  frames  are  missing.  The  missing 
frames  were  estimated  using  the  procedure  discussed  earlier  in  Sections  4.1  and  5.  The  empty 
bins  of  Fig.  13b  were  filled  with  these  estimated  frames  and  the  resulting  microscan  image  is 
shown  in  Fig.  13c.  The  4x4  microscan  image  after  the  application  of  Wiener  filtering  is  shown  in 
Fig.  13d.  Notice  that  the  application  of  Wiener  filtering  further  increased  the  resolution  of  the 
4x4  microscan  image.  The  reason  for  this  improvement  is  that  by  using  Wiener  filtering  one 
reduces  the  blurring  of  the  MTF  which  is  responsible  for  suppressing  the  increase  in  image 
resolution  that  should  be  obtained  at  higher  microscan  levels.  There  appears  to  be  slight  ringing 
in  the  4x4  microscanned  image  after  the  application  of  Wiener  filter  (Fig.  13d).  This  ringing  is 
most  noticeable  in  the  boundary  and  in  regions  of  high  contrast,  and  is  most  likely  the  result  of 
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phase  distortions  caused  by  the  zero  crossing  values  present  in  the  system  MTF.  Figures  13e  is 
the  same  as  Fig.  13a  but  bilinear  interpolation  was  used  for  the  image  replication.  Comparing  Fig 
13c  with  13d  and  13e,  it  is  evident  that  the  proposed  technique  yields  significantly  improved 
image  resolution  even  without  Wiener  filtering. 

7  Conclusion 

We  have  developed  an  efficient  technique  for  real  time  infrared  image  registration  and  high 
resolution  reconstruction  by  using  multiple  frames  which  are  randomly  shifted  with  respect  to 
one  another  (uncontrolled  microscanning).  Such  translationally  shifted  image  sequences  may  be 
obtained  by  utilizing  the  random  motion  and/or  vibration  of  the  platform,  such  as  an  aircraft,  on 
which  the  imager  is  mounted.  A  registration  algorithm  for  estimating  the  random  translational 
shifts  among  the  multiple  frames  has  been  designed  which  is  found  to  be  inherently  suitable  for 
real  time  implementation.  This  algorithm  can  be  used  for  estimating  both  non-integer  as  well  as 
shift  values  larger  than  the  detector  width.  However,  with  uncontrolled  microscanning,  some  of 
the  frames  acquired  by  the  imager  may  have  shifts  similar  to  other  frames  (missing  frames) 
which  results  in  empty  bins  in  the  high  resolution  image  reconstruction.  This  problem  has  been 
tackled  by  developing  a  fc-nearest  neighbor  approach  for  estimating  the  missing  frames  so  that  it 
can  be  implemented  in  real  time.  Finally,  Wiener  filtering  has  been  successfully  applied  as  an 
effective  means  of  reducing  blurring  in  microscan  images  caused  by  the  system  MTF.  A 
computer  program  which  closely  simulated  the  operation  of  the  microscan  system  has  been 
developed.  The  knowledge  obtained  from  the  microscan  simulation  was  applied  to  real  FLIR 
data  collected  on  the  microscan  imaging  system.  The  results  of  the  application  of  the  Wiener 
filter  to  the  microscan  images  confirmed  the  results  obtained  from  the  simulation  data.  It  was 
found  that  Wiener  filtering  effectively  reduces  the  effects  of  blurring  and  increases  image 
resolution.  This  especially  true  with  the  real  FLIR  data  (compare  Figs.  13c  and  13d). 

Future  Research 

Future  research  should  address  the  following  important  issues: 

1.  In  a  practical  scenario,  the  frames  recorded  by  the  FLIR  array  may  be  rotated  as  well  as 
translated  with  respect  to  each  other.  Therefore,  it  is  essential  to  incorporate  both  the 
rotational  and  translational  registration  parameters  in  the  high  resolution  image 
reconstruction. 

2.  Investigate  the  effects  of  detector  nonuniformity  (gain  and  bias  drift)  which  creates  fixed 
pattern  noise  on  an  image  and  developing  a  suitable  algorithm  to  fix  this  problem. 

3.  Investigate  the  effects  of  both  additive  and  multiplicative  noise  sources  and  develop  a 
suitable  algorithm  to  mitigate  their  effects. 

4.  Improving  the  interpolation  techniques  for  calculating  the  missing  frames  for  real  time 
applications. 

5.  Investigating  the  effects  of  temporal  blurring  caused  by  factors  such  as  atmospheric 
turbulence  and  alleviate  such  problems. 
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Figure  10a. 


Figure  10b. 


Figure  10c. 

Figure  10.  (a)  through  (c)  are  level  2, 4  and  8  simulated 
microscan  images  with  missing  frames. 


Figure  11a. 


Figure  lib. 


Figure  11c. 

Figure  11.  Restored  images  of  Fig.  10  after  estimating 
the  missing  frames. 
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Figure  12.  Wiener  Filtering.  Figs.  1  la,  b,  and  c  are  the 
Wiener  filtered  versions  of  Figs.  10a,  b  and  c. 


Figure  13.  Real  FLIR  Images,  (a)  staring  frame, 

(b)  4x4  microscan  image,  (c)  restored  image. 


Figure  13d. 


Figure  13e. 


Figure  13.  Real  FLIR  Images,  (d)  restored  image  after  Wiener  filtering,  (e)  staring  frame  after  bilinear  interpolation. 


ADDENDUM 


High  resolution  infrared  images  may  also  be  reconstructed  from  multiple  translationally  shifted 
aliased  frames  using  iterative  techniques  such  as  Bayesian  approach  via  maximum  a  posteriori 
(MAP)  algorithm3  or  maximum-likelihood  approach  via  expectation  maximization  (EM) 
algorithm4.  The  MAP  algorithm  utilizes  a  prior  statistical  model  for  estimating  the  translational 
shifts.  With  this  model,  a  maximum  a  posteriori  estimator  is  used  to  estimate  the  high  resolution 
image  and  the  intraframe  translational  shifts.  The  shifts  are  iteratively  updated  along  with  the 
high  resolution  image  using  a  suitable  optimization  technique  such  as  gradient  descent 
optimization.  In  the  maximum-likelihood  approach,  the  data  are  considered  Poisson  random 
variables  and  an  EM  algorithm  is  constructed  that  iteratively  estimates  an  unaliased  image 
compensated  for  the  known  imager  system  blur  while  simultaneously  estimating  the  translational 
shifts.  Both  of  these  algorithms  have  been  found  to  yield  high  resolution  images  from  a  sequence 
of  randomly  shifted  frames.  However,  each  of  these  algorithms  require  significant  computation 
time  and  can  not  be  implemented  for  real  time  applications  using  the  currently  available  high 
performance  microprocessors.  The  EM  algorithm  has  been  found  to  be  faster  than  the  MAP 
algorithm. 

In  this  work,  we  have  used  the  registration  algorithm  discussed  earlier  in  Section  3  to  estimate 
the  translational  shift  parameters  in  just  one  step.  Using  this  shift  information,  we  have  applied  a 
simplified  version  of  the  EM  algorithm  to  estimate  the  high  resolution  image  from  the  given 
sequence  of  frames.  The  original  EM  approach  recalculates  the  shits  parameters  with  each 
iteration  of  the  algorithm.  The  new  image  shift  estimates  are  calulated  by  evaluating  a  cost 
function  that  compares  the  shifted  and  interlaced  data  frames  to  the  corresponding  values  in  the 
algorithm’s  latest  high  resolution  image  estimate.  The  cost  function  is  the  sum  over  all  the  pixels 
in  the  high  resolution  gird  of  the  absolute  value  of  the  interlaced  data  frames  divided  by  the  high 
resolution  image  estimate.  The  shift  parameters  provided  by  the  new  registration  algorithm  are 
accurate  enough  to  eliminate  the  need  for  iterative  calculation  of  the  translational  shifts. 

The  proposed  modified  EM  algorithm  has  been  found  to  drastically  reduce  the  computation 
burden  compared  to  the  original  EM  algorithm  thereby  making  it  more  attractive  for  practical 
implementation.  To  verify  the  performance  of  the  modified  EM  algorithm,  let  us  consider  the  16 
staring  FLIR  real  image  data  frames  used  in  Section  6.  One  of  these  16  staring  FT  JR  image 
frames  is  shown  in  Fig.  13a.  Using  these  16  frames,  the  output  obtained  by  the  original  EM 
algorithm4  and  that  obtained  by  the  modified  EM  algorithm  after  ten  iterations  are  shown  in 
Figures  14a  and  14b  respectively.  Comparing  Figs.  14a  with  14b,  we  observe  that  the  modified 
EM  algorithm  yields  high  quality  image  and  can  be  considered  as  a  viable  alternative  to  the 
original  EM  algorithm.  For  page  number  limitation,  the  author  could  not  discuss  this  approach  in 
detail  in  this  report. 
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Abstract 

During  the  summer  of  1996  a  program  was  initiated  by  the  authors  to  study  the 
characteristics  of  pilot  induced  oscillations  (PIOs).  The  long  term  goals  are 

•  to  develop  a  methodology  to  take  time  history  data  from  flight  or  ground  based 
simulation  and  determine  if  a  vehicle  is  PIO  prone, 

•  to  determine  if  a  PIO  actually  occurred  in  a  time  history  record, 

•  to  develop  a  non-real-time  analysis  tool  to  determine  from  flight  simulator  time 
history  data  if  PIOs  occurred  so  that  simulation  engineers  can  help  insure 
consistency  between  pilot  comments,  Cooper-Harper  ratings,  PIO  ratings  and  time 
history  data,  and 

•  to  develop  a  real-time  capability  of  detecting  a  PIO  fast  enough  to  take  action  to 
prevent  the  full  development  of  the  PIO. 

Results  obtained  over  the  period  7/1/96-8/23/96  deal  primarily  with  the  first  goal 
and  are  described  in  this  report.  A  computer  analysis  tool  is  developed  in  which  fast  fourier 
transforms  are  used  to  determine 

•  resonant  frequency  and  output  phase  angles  at  the  resonant  frequency, 

•  predictions  of  PIO  susceptibility  using  the  Smith-Geddes  PIO  criteria. 

The  following  preliminary  results  are  highly  encouraging. 

•  13  of  14  configurations  which  experienced  PIOs  were  correctly  identified. 

•  In  12  additional  configurations  6  were  correctly  identified  as  being  not  PIO  prone. 
Results  for  the  remaining  6  configurations  were  complicated  by  multiple  resonances 
that  confuse  the  resonance  detector.  However,  depending  on  the  choice  of  resonant 
frequency  the  correct  classification  is  made  for  each  of  the  6  configurations. 

•  The  occurrence  of  multiple  resonances  is  closely  associated  with  configurations  that 
are  not  PIO  prone. 

•  continued  development  of  the  resonance  detector  is  planned. 
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A  FAST  FOURIER  TRANSFORM  ANALYSIS 
OF  PILOT  INDUCED  OSCILLATIONS 

Dominick  Andrisani,  II  and  Sten  E.  Berge 


Background 

Pilot  induced  oscillations  (PIOs)  have  received  much  recent  attention  [2-6].  Several 
approaches  to  development  of  a  unified  PIO  theory  are  currently  being  pursued.  A  catalog  of 
PIO  events  has  been  compiled  and  several  existing  criteria  have  been  shown  to  successfully 
predict  PIO  susceptibility.  Accurate  analysis  of  the  ever-growing  PIO  database  will  gather 
insight  into  the  characteristics  of  PIOs.  With  this  information,  a  PIO  detector  can  be 
designed  for  time  history  data  as  generated  in  either  flight  or  simulation. 


Discretization  of  Time  History  Data 

Most  PIO  time  history  data  in  the  catalog  of  PIO  events  is  in  paper  copy  form;  the 
original  flight  data  having  been  lost  or  unavailable  for  various  reasons.  As  a  first  step  in 
this  analysis,  various  time  history  records  from  the  PIO  database  were  digitized.  Eleven 
classic  PIOs  and  eighteen  Bjorkman  (NT-33)  time  histories  [7]  were  digitized.  The  eleven 
classic  PIOs  were  taken  from  the  PIO  catalog  compiled  by  HAI  in  cooperation  with  STI[2]. 
There  is  one  roll  ratchet,  several  rate  limited  actuator  cases,  mode  switches,  and  both 
lateral  and  longitudinal  examples.  The  Bjorkman  time  histories  are  all  longitudinal,  but 
provide  a  consistent  database  encompassing  nearly  the  entire  range  of  the  PIO  tendency 
classification  scale.  The  digitization  was  hampered  by  the  quality  of  the  reproductions  and 
the  scales  of  the  plots.  Sections  of  data  were  often  missing  because  of  repeated  previous 
reproduction.  Digitization  noise  was  introduced  into  the  data  axis  scaling  was  not  well 
chosen.  For  example,  the  B-2  charts  cover  too  much  time,  and  the  non-PIO  Bjorkman 
configurations  are  plotted  with  the  same  scale  as  the  large  amplitude  PIO  histories. 
Conversely,  there  were  several  data  sets  with  time  scaling  that  covered  too  small  a  time 
period  and  which  cut  off  the  peaks  of  the  data.  The  M2F2  charts  are  examples  of  this  other 
extreme. 

The  following  multi-step  digitization  process  was  employed. 

•  PIO  time  histories  were  scanned  and  cleaned  up  using  OFOTO  commercial 

software. 

•  PIO  time  histories  were  digitized  the  data  using  dataThief  shareware. 

•  The  resulting  ASCII  data  file  was  converted  to  MATLAB  script  format. 
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•  A  MAT  LAB  script  was  written  to  produce  a  second  MATLAB  script  file  with  a 

specified  uniform  time  increment 

*  A  MATLAB  script  was  written  to  plot  the  time  history  data  in  a  similar  format  to 

the  original. 

In  all  27  longitudinal  records  (flight  records  from  the  T-38,  Shuttle,  B-2,  F-8,  YF-22, 
and  X-15,  18  Bjorkman  NT-33  flight  records,  and  3  HAVE  PIO  ground  based  simulation 
records)  and  5  lateral  records  (flight  records  from  the  F/A-18(2),  F-14,  YF-16,  and  M2-F2) 
were  digitized.  These  records  are  available  at  WL/FIGC-2  (Mr.  David  Leggett  is  the  point  of 
contact,  513-255-8498). 


Technical  Accroach 

A  spectral  analysis  is  performed  on  selected  time  histories  to  determine  the 
quantities  needed  for  PIO  characterization.  The  resonance  of  each  measurement  is 
determined  from  power  spectral  density  for  that  variable.  The  actual  resonant  frequency 
will  differ  from  the  discrete  frequencies  used  in  the  power  spectral  density  plot,  unless  the 
resonance  happens  to  correspond  to  one  of  the  harmonics  of  the  spectral  analysis.  Therefore, 
the  resonant  frequency  is  approximated  by  finding  the  centroid  of  the  peak  and  its  two 
nearest  neighbors.  The  cross  spectrum  between  input  and  output  is  used  to  calculate  a 
frequency  response  function  relating  input  and  output.  The  phase  angle  at  the  resonant 
frequency  is  taken  from  the  phase  angle  of  the  frequency  response  function.  The  validity  of 
the  results  is  related  to  the  coherence  of  the  cross  spectral  data  points  used. 

The  Smith-Geddes  criteria  are  then  evaluated.  The  pilot-vehicle  resonant  frequency 
is  chosen  to  be  the  resonant  frequency  of  the  variable  with  the  most  reliable  time  history. 
This  is  most  often  pitch  rate.  The  Smith-Geddes  criteria  require  the  phase  angles  of  both 
8(j©)/F8(j(fl)  and  Nz(j©)/Fs(j©).  The  phase  of  6(j<o)/Fg(jto)  can  also  be  calculated  by 
subtracting  90  degrees  from  the  phase  of  q(jco)/F8(ja)).  The  Smith-Geddes  analysis  is  done 
with  both  the  0  and  q  phase  angles.  Susceptibility  to  PIO  as  defined  in  MIL-STD-1797A 
[8],  approximate  Cooper-Harper  rating,  and  approximate  PIO  rating  based  on  Ralph  Smith's 
data  method  [10]are  outputs  of  this  analysis. 

This  procedure  is  not  be  completely  automatic  at  this  time  for  two  reasons.  First, 
the  FFT  window  size  must  be  chosen  subject  to  several  constraints:  the  primary  (lowest) 
harmonic  (set  by  the  length  of  time  (T window)  used  in  the  FFT,  A®=2x/Twindow)  should  be 
approximately  1  rad/sec,  and  the  highest  harmonic  (set  by  the  time  increment  (AT), 
Q>ma y=ft/AT)  must  be  at  least  20  rad/sec  to  include  the  resonant  frequencies  of  all  possible 
types  of  PIOs  including  pitch  bobbles  and  roll  ratchets,  the  window  not  contain  discrete 
events  such  as  control  system  mode  switches,  and  at  least  two  similar  windows  are  required 
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to  compute  coherence.  Second,  the  best  pilot-vehicle  resonant  frequency  must  be  manually 
chosen  when  multiple  resonances  are  present.  Unless  the  same  dominant  resonant 
frequency  appears  in  all  measurements,  the  correct  choice  is  unclear.  Further  research  on 
this  problem  is  continuing. 

The  spectral  analysis  is  performed  by  MATLAB's  spectrum  function  in  the  Signal 
Processing  Toolbox.  The  procedure  used  is  Welch’s  averaged  periodogram  method  [1,9]. 
Each  frame  consists  of  three  subframes  (two  consecutive  and  one  overlapping  one  half  of  the 
other  two).  Each  subframe  is  preprocessed  by  removing  any  bias  and  is  windowed  with  a 
Hanning  window.  Use  of  more  than  one  subframe  helps  insure  valid  coherence.  The 
software  steps  through  the  time  history  frame  by  frame. 

PIO  susceptibility  is  determined  by  the  Smith-Geddes  criteria  using  the  following 

steps. 

•  Pilot-in-the-Loop  Properties 

Resonant  frequency,  wr. 

Amplitudes  of  inputs  and  outputs  at  wr  from  FFT  data 

©U®.)>  ?U®,)>  «  O'®,) 


Pilot- Vehicle  System  Properties:  Phase  lags  computed  using  Cross  Spectral  densities 
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A  few  comments  are  in  order.  We  evaluate  phase  angles  at  the  resonant  frequency  of 
the  pilot-in-the-loop  system,  wr.  The  Smith-Geddes  criterion  evaluates  the  phase  angles  at 
the  criterion  frequency,  Wc  =  0.24S  +  6  where  S  is  the  average  slope  of  the  I  q(j©)/Fs(jco)  I 
over  the  interval  from  1-6  radians  per  second,  wc  is  an  approximation  to  the  crossover 
frequency  of  the  pilot-vehicle  system  during  pitch  attitude  tracking.  If  there  is  a  dominant 
resonance  in  a  tame  history  record  our  approach  is  reasonable  since  the  pilot-vehicle  system 
will  resonate  near  the  crossover  frequency  of  the  pilot-vehicle  system. 

Our  approach  is  more  than  an  open  loop  method  because  our  resonant  frequency  is  a 
property  of  the  pilot-vehicle  system  as  expressed  in  the  time  histories  from  the  piloted 
system. 


Problems  with  the  Resonant  Frequency  Detector 


Several  problems  were  encountered  with  our  resonant  frequency  detector. 

•  Power  spectra  of  important  traces  sometimes  have  multiple  peaks.  Figure  1 
shows  an  example  of  a  time  spectrum  with  two  resonant  peaks.  Figure  2  shows  the  more 
common  case  where  there  is  a  single  dominant  resonant  peak. 

•  Power  spectra  of  different  traces  sometimes  have  different  peaks  as  shown  in  the 
sketch  below. 


Power 


Power 


Frequency 

•  Low  frequency  power  sometimes  confuses  our  resonant  frequency  detector.  Figure  1 
also  illustrates  this  problem. 

There  are  several  possible  solutions  to  those  problems.  Multiple  peaks  on  one  power 
spectrum  and  peaks  that  don't  line  up  seem  to  correlate  with  the  absence  of  PIO.  Low 
frequency  power  might  be  removed  by  a  different  choice  of  windowing  in  computing  the  FFT 
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or  by  filtering  the  time  history  before  the  FFT  is  taken.  W e  are  currently  using  a  Hanning 
window. 

The  end  of  the  summer  prevented  our  resolution  of  the  problems.  Development  of  the 
resonant  frequency  detector  will  continue  until  the  end  of  1996  by  Professor  Andnsani. 

Results 

The  results  from  our  analysis  of  longitudinal  PIO  data  are  summarized  in  Tables  1 

and  2. 

PIO  amplitudes  versus  resonant  frequency  is  shown  in  Figure  3.  The  longitudinal 
PIOs  in  this  study  occur  at  a  wide  range  of  amplitudes.  This  suggests  amplitude  of  the 
resonance  is  not  a  good  metric  to  determine  if  a  PIO  is  occurring. 

Angle  4>  (of  the  Smith-Geddes  criterion  [10])  versus  actual  average  PIO  rating  for  the 
Bjorkman  data  is  shown  in  Figure  4.  The  line  in  the  figure  comes  from  [10]  and  seems  to 
underestimate  the  average  PIO  ratings.  Average  Cooper-Harper  ratings  versus  angle  of 
©(jcoyFsCjo))  is  shown  in  Figure  5. 

Figure  6  shows  that  the  occurrence  of  a  PIO  seems  to  correlate  well  with  a  single 
dominant  resonant  frequency  in  the  time  history.  This  characteristic  seems  more  helpful  in 
determining  that  a  PIO  has  occurred  than  output  amplitude  at  the  resonant  frequency. 
More  work  is  required  here  to  determine  how  to  effectively  use  this  fact 

Table  1  indicates  that  acceleration  type  PIO  were  important  in  detecting  PIO 
susceptibility  in  only  1  of  24  cases  (B2).  The  need  for  acceleration  type  PIO  is  currently 
under  review  by  the  authors.  These  results  indicate  that  the  Smith-Geddes  boundaries  are 

reasonable. 

When  phase  angles  vary  significantly  during  a  time  history  nonlinear  behavior  is 
suspected.  One  cause  of  this  is  rate  limiting,  where  phase  lag  is  a  function  of  amplitude  and 
frequency.  The  Shuttle  record,  Figure  7,  shows  strong  time  varying  phase  angle  of 
q(joj)/Fs(ja>).  The  B2  record,  Figure  8,  shows  almost  no  time  varying  phase  angle  of 
q(jco)/Fs(j©).  The  ability  of  this  method  to  track  time  varying  phase  angles  is  significant 

because  this  can  trigger  a  PIO. 

Towards  a  Time  Domain  PIO  Detector 

The  following  comments  summarize  the  status  of  our  efforts  at  developing  a  FFT 
Based  PIO  detector. 

•  The  existing  algorithm  is  not  set  up  to  run  in  real  time.  A  Wright  Laboratory  in- 
house  effort  by  Mr.  Phil  McKeehan  is  anticipated  in  this  area. 
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•  Some  operator  intervention  is  helpful  in  selecting  the  time  periods  to  analyze  so 

that  PIO  events  are  not  split  into  different  time  frames. 

•  Improvement  is  necessary  on  the  resonant  frequency  detector. 

•  We  still  need  to  determine  in  what  way  amplitude  at  resonant  frequency  is  helpful 

for  PIO  detection. 

•  We  need  to  decide  what  the  occurrence  of  multiple  resonances  indicates  about  PIO. 

Conclusions 

Based  on  recently  digitized  time  histories  of  27  longitudinal  PIOs  the  following 
conclusions  are  drawn. 

•  Our  FFT  Based  PIO  Analysis  Software  using  the  Smith -Geddes  criteria  does  an  excellent 

job  of  determining  PIO  susceptibility. 

•  Preliminary  results  indicate  that  the  Smith-Geddes  boundaries  are  reasonable. 

•  Acceleration  type  PIO  were  important  in  detecting  PIO  proneness  in  only  1  of  24  cases 

(B2).  The  need  for  this  characterization  is  under  continuing  review. 

•  Continued  development  of  the  resonant  frequency  detector  is  required  for  cases  where 

multiple  resonances  occur. 

•  The  ability  of  this  method  to  track  time  varying  phase  angles  is  significant  because  this 

can  serve  as  a  trigger  for  PIOs. 


Further  Work 

During  Professor  Andrisani's  sabbatical  at  Wright  Laboratories  (8/19/96-12/20/96) 
the  additional  work  on  the  following  items  is  planned. 

•  resonant  frequency  detector, 

•  adequacy  of  the  Smith-Geddes  boundaries, 

•  need  for  acceleration  type  PIOs, 

•  importance  of  multiple  resonances  in  PIO  detection, 

•  automating  the  analysis  process  to  achieve  the  third  goal  mentioned  in  the 

abstract. 

An  in-house  Wright  laboratory  effort  is  anticipated  to  develop  and  implement  a  real 
time  PIO  detector  based  upon  this  work. 
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Table  1.  Summary  of  Numerical  Results  for  Longitudinal  PIOs. 


PHASE  LAG  ANALYSIS 

Long.  P10  from  Right 

Aircraft 

WT 

Theta  Wrdeg 

qWrdeg 

nzWrdeg 

phi 

PIO  in  time  history 

T-38 

7.49 

-268 

-234 

-248 

-355 

Shuttle(t- 24-30) 

2.40 

NaN 

-153 

NaN 

NaN 

ShuttJe(t-26-32) 

3.40 

NaN 

-171 

NaN 

NaN 

B2  (t=25-3 1 .7) 

3.63 

-165 

-8 

-138 

-190 

F8  (t«=8. 1-20.6) 

2.79 

-220 

-144 

-185 

-225 

YF-22 

3.25 

-213 

-121 

-197 

-243 

Bjorkman  2-5 

2.57 

-208 

-125 

-196 

-232 

Bjorkman  2-8 

4.33 

-219 

-126 

-174 

-236 

Bjorkman  3-12 

1.91 

-162 

-72 

-126 

-154 

Bjorkman  3-13 

3.21 

-184 

-130 

-178 

-224 

Bjorkman  5-9 

3.51 

-227 

-122 

-185 

-235 

Bjorkman  5-10 

2.81 

-195 

-119 

-176 

-216 

HAVE,  4/13,#21(Bj3-13) 

1.32 

-277 

-196 

-261 

-279 

HAVE,  4/13,#22(Bj3-13) 

-195 

-271 

-287 

HAVE,  4/13,#23(Bj3-13) 

1.76 

-320 

-236 

-322 

-348 

no  PIO  in  time  history 

Bjorkman  2-B 

4.95 

-123 

-74 

-137 

-207 

Bjorkman  2-1(wr=1.1) 

1.12 

-70 

37 

-93 

-109 

(wr-5.4) 

5.43 

-266 

-97 

-178 

-256 

Bjorkman  3-D 

6.88 

-180 

-89 

-32 

-130 

Bjorkman  3-1 

1.78 

-67 

14 

-67 

-93 

Bjorkman  3-3 

3.51 

-119 

-47 

-114 

-165 

Bjorkman  3-6  (wr-8.9) 

8.87 

-203 

-108 

-36 

-163 

3.00 

-105 

-13 

-50 

-93 

bJ  Ll.U.Ad 

5.97 

-174 

-94 

-141 

-227 

(wr-10) 

10.00 

-277 

-122 

-340 

wmml 

(wr-2.5) 

2.52 

-120 

-8 

-92 

-128 

Bjorkman  4-2 

4.83 

-155 

-65 

-129 

-198 

Bjorkman  5-1 

2.37 

-100 

-35 

-100 

-133 

maybe  PIO  in  time  history 

Bjorkman  2-7(t«  14-29) 

4.29 

-197 

-119 

-138 

-200 

(t=22-29,  wr-4.3) 

4.28 

-194 

-119 

-144 

-205 

(t=22-29,  wr«6.1) 

6.12 

-241 

-149 

-120 

-208 

Bjorkman  3-8 

7.03 

-246 

-155 

-117 

-217 

Bjorkman  5-1 1  (wr«2.5) 

2.50 

-202 

-113 

-142 

-178 

(wr-1.9) 

1.87 

-147 

-56 

-108 

-135 

(wr-3.2) 

3.32 

-229 

-137 

-145 

-193 
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Table  1,  concluded 


Overall  Results 


Long.  PIO  from  Flight 


Flight  Ratings 


Aircraft _ _ _ I  Act.  PtOR 

PIO  in  time  history 


T-38 


Shuttle(t«24-30) _ 


Shuttle(t~26-32) _ 


B2  (t=25-3V7) _ 


F8  (t=8. 1-20.6) _ 


YF-22  _ 


Bjorkman  2-5 


Bjorkman  2-8  _ 


Bjorkman  3-12 _ 


Bjorkman  3-13 _ 


Bjorkman  5-9 _ 


Bjorkman  5-10 


HAVE,  4/13,#21(Bj3-13) 


HAVE,  4/13,#22(Bj3-13) 


HAVE,  4/l3.#23(Bj3-13) 


no  PIO  in  time  history 


FFT  Classification 


Sus_PI0  Correct 


4.33(4,4,5) 


4(4, 4, 4) 


4.5(4, 5) 


4.5(4, 5) 


4(4,4) 


5(5,5) 


Bjorkman  2-B _ 


Bjorkman  2-1(wr»1.1) 

(wr-5.4) 


iorkman  3-D 


Bjorkman  3-3  _ 


Bjorkman  3-6  (wr-8.9) _ 


(wr~3.0) 


iorkman  4-1  (wr-6) 


2(3,2, 2,1) 


10,1,1) 


10,1) 

1 

2.33(3,2,2) 

2 

1.68(3,1,1) 

1 

2(2,2) 

2 
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Workman  5-1 


1.33(1,1,2) 


10.1) 


maybe  PIO  in  time  history 


Bjorkman  2-7 (t- 14-29) 


(t=22-29,  wr«4.3) 


(t-22-29,  wr-6.1) 


jorkman  3-8 


Bjorkman  5-11  (wr=2.5) 


single  dominant  freq 


single  dominant  freq _ 


clear  reson.,  not  quite,  miss 


single  dominant  freq _ 


single  dominant  freq 


single  dominant  freq 


multiple  resonances 


rerun  wr-6 


multiple  resonances _ 


reson.  w/dominant 


resonances 


reum  wr-3 


1 

n 

1 

c 

this  is  the  'best*  wr 


resonances 


Table  2.  Summary  of  PIO  Classification  Results 


Configurations 

Results 

14  PlO-prone  Configs 

Bjorkmans  configs  with 

PIOR  consistently  >4 

T-38,  Shuttle,  B-2,  F-8,  YF-22 

S  HAVE  PIOs  (Bjorkman  3-12) 

correct  classification:  13 
wrong  classification:  1 

(Bjorkman  3-12) 

(There  was  a  clear  single  dominant 
frequency.) 

9  not  PI  O-p rone  Configs 

Bjorkman  Configs  with 

PIOR  consistently<4 

correct  classification:  6 
correct  by  choice  of  wn  3 

wrong  classification:  0 

3  maybe  PIO  Configs 
at  least  1  PIOR  >*4 
(  Bjorkman  2-7, 

Bjorkman  3-8, 

Bjorkman  5-11) 

correct  by  choice  of  wn  3 
wrong  classification:  0 

ConfigCPIOR) 

Bj2-7  (4^,2)  PlO-prone  at  3/3  wrs 
Bj3-8  (4,3,4)  PlO-prone  at  1/1  wrs 
Bi5-ll(2.4.S)  PlO-prone  at  2/3  wrs 
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power  (unitsA2/Hz) 
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Resonance  from  Pilot  Input,  F8,  Time=(  8.08  -  20.57)  sec 


Figure  2.  Power  spectrum  showing  a  single  dominant  frequency 
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2-15 


10.00 


Type  I  PIO  Rating 


Type  III  PIO  Rating 


Bunea  HO  -6a y 

Figure  5.  Actual  average  Cooper-Harper  ratings  versus  phase  of  ©(jcorVFsdcor) 

for  Bjorkman  records 
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Figure  6,  The  importance  of  a  single  dominant  frequency  for  PIO 
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Time  (sec) 


Figure  7.  Shuttle  record  showing  significant  time  variation  of  phase  angle 

of  qOcoiO/FgCjcor) 


2- 


B-2 


Figure  8.  B2  record  showing  insignificant  time  variation  of  phase  angle 

of  qCjfflrl/FgCjtOr) 
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Materials  Engineering  &  Industrial  Design 

Abstract 

The  effect  of  initial  microstructure,  hot-working  parameters,  and  post¬ 
processing  heat  treatments  on  Ti-6A1-4V  were  studied.  Included  in  the  studies  were 
optical  metallography,  crystallographic  texture  measurements  using  back-reflection 
pole-figures,  and  micro-texture  determination  using  orientation  image  microscopy. 
The  results  demonstrate  strong  dependency  on  the  processing  parameters,  and  the 
existence  of  micro-texture  even  in  some  of  the  cases  when  overall  the  crytallographic 
orientation  is  random. 
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TEXTURE  AND  MICROSTRUCTURE 
OF  HOT  ROLLED  Ti-6AMV 


Pnina  Ari-Gur 


Introduction: 

Ti-6A1-4V  is  a  high-strength  light-weight  material  that  has  been  in  extensive  use 
for  aerospace  applications.  Its  processing  is  difficult  —  it  is  not  workable  at  low 
temperatures,  while  at  temperatures  above  75%  of  the  solidus  temperature,  its 
developing  microstructure  is  hard  to  control  [1]. 

At  room  temperature  and  up  to  1815°F,  Ti-6A1-4V  has  a  dual  phase  (a+p) 
structure.  The  a-phase  (HCP),  the  dominant  phase  throughout  this  temperature  range, 
is  highly  anisotropic  in  nature.  The  second  phase  (j3)  is  BCC,  and  is  the  single  phase 
present  above  the  transus  temperature  (1815°F).  Numerous  studies  investigated  the 
relationship  between  processing  and  crystallographic  texture  development  in  dual¬ 
phase  titanium  alloys  and  its  effect  on  the  mechanical  properties.  However,  failure 
analyses  done  on  Ti-6A1-4V  parts  often  reveal  that  even  in  the  absence  of  overall 
preferred  orientation  (texture)  in  the  part,  aligned  a  colonies  provide  nucleation  site  for 
internal  fatigue  cracks.  This  phenomenon  may  result  in  a  ten-fold  decrease  in  fatigue 
life  [2,3].  The  aligned  a  colonies  can  be  traced  back  to  the  original  (3  (BCC)  grain  that 
existed  above  the  transus  temperature.  Breaking  the  original  microstructure,  as  well  as 
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elimination  of  the  micro-texture  (local  aligned  colonies),  are  what  thermo-mechanical 
processing  [1]  is  aimed  at. 

The  goal  of  this  work  was  to  study  the  effects  of  initial  microstructure,  assorted 
hot-rolling  parameters,  and  several  post-processing  heat  treatments  on  the  evolution  of 
microstructure,  crystallographic  texture,  and  micro-texture. 

2.  EXPERIMENTAL  PROCEDURES 

2.1  Processing: 

The  as-received  plate  of  Ti-6A1-4V  (1.17”  thick)  was  divided  to  two  groups, 
that  were  heat  treated  differently  to  provide  different  initial  microstructures.  Each  one 
was  heat  treated  in  preparation  for  the  hot  rolling.  These  are  summarized  in  Table  2.1. 
Table  2. 1  Pre-processing  Heat  Treatments 


Group  1 

1950°F 

EMM 

Air  cool 

(P  annealed) 

Time 

Mmm 

15  min 

Group  2 

1725°F 

1950°F 

Water 

(P  quenched) 

Time 

20  min 

15  min 

Quench 

The  1725°F  and  1500°F  treatments,  following  the  15  min  at  1950°F  for  group  1, 
were  intended  to  simulate  the  slow  cooling  of  large  billets. 


To  study  the  development  (or  breakdown)  of  structure  during  the  process, 
specimens  were  hot  rolled  to  two  levels  of  strains: 

•  Light  reduction  (sx  =  0.57,  from  1.17”  to  0.658”) 

•  Heavy  reduction  (sT  =  1.5,  from  1.17”  to  0.27”  ) 
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Three  rolling  temperatures  were  chosen  in  consideration  of  the  {3  transus 
temperature  (1815°F)  and  the  minimum  temperature  where  Ti-6A1-4V  is  still  workable. 
The  rolling  temperatures  were: 

•  1780°F 

•  1700°F 

•  1500°F 

To  guarantee  temperature  uniformity,  specimens  were  placed  in  the  furnace  for 
40  minutes  prior  to  rolling.  The  reduction  per  rolling  pass  was  10%  (except  for  the 
rolling  at  1500°F  that  was  performed  at  5%  reduction  per  pass).  Between  passes,  the 
specimens  were  re-heated  for  3  minutes. 

To  study  the  effect  of  dynamic  versus  static  recrystalliztion,  some  of  the 
specimens  were  heat  treated  following  the  hot-rolling.  These  heat  treatments  are 
summarized  in  Table  2.2.  Group  number  four  (as-hot  rolled)  was  tested 
metallographically  only.  The  heat  treatment  at  1300°F  is  for  relieving  stresses  only  and 
does  not  change  the  structure  or  texture.  It  was  needed  for  the  micro-texture  studies, 
due  to  the  sensitivity  of  the  orientation  imaging  technique. 

Table  2.2  Post-processing  Heat  Treatments 


Temperature 

Time 

Cooling 

1 

2  hours 

Air 

2 

1600°F 

2  hours 

Air 

3 

1750°F 

2  hours 

Air 

4 

As  hot  rolled 

N/A 

N/A 

5 

1300°F 

1  hour 

Air 
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2.2  Testing 


The  microstructure,  crystallographic  texture,  and  micro-texture  of  the  hot-rolled 
and  heat-treated  samples  were  characterized  by  regular  optical  metallography,  x-ray 
diffraction,  and  orientation  imaging  microscopy.  For  the  sake  of  comparison,  the  pre¬ 
rolling,  as  heat  treated  plates  were  tested  too. 

2.2.1  Metallography 

Metallographic  samples  were  sectioned  parallel  to  the  N-R  plane  (perpendicular 
to  the  transverse  direction),  polished  and  etched  for  15  seconds  in  Kroll’s  solution  (3.5 
vol%  HN03,  1.5  vol%  HF,  balance  H20).  Specimens  of  as-hot  rolled,  as  well  as  heat 
treated  (Table  2.2)  were  photographed. 

2.2.2  Texture 

X-ray  diffractometer  (Cu  Ka  radiation)  equipped  with  pole-figure  goniometer 
was  used  for  the  crytallographic  texture  studies.  Basal  plane  (0002)  pole  figures  were 
created  for  samples  cut  and  polished  in  the  R-T  plane  (parallel  to  the  rolling  plane  about 
2.5mm  below  the  surface).  Only  back  reflection  data  were  collected  (maximum  tilt  of 
80°). 

2.2.3  Micro-texture 

Samples  were  cut,  mechanically  polished  and  electro-polished  parallel  to  the  N- 
T  plane  (perpendicular  to  the  rolling  direction).  The  electro-polishing  was  necessary  to 
remove  any  surface  damage  left  after  the  careful  mechanical  polishing  Two  areas  of 
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2mm  x  2mm  each  were  scanned  (1mm  x  4mm  for  the  heavily  rolled  samples),  one  close 
to  the  rolling  surface,  and  the  other  --  at  mid-section). 

Icosahedral  maps  were  plotted  for  the  basal  (0002)  plane.  They  were  based  on 
40,000  points  of  measurement  each.  Each  indexed  point  is  within  0.9°  of  an 
icosahedral  face  (or  else  the  program  determines  that  it  is  unable  to  index  it  and  the 
point  appears  white  on  the  map). 

In  addition  to  the  icosahedral  maps,  pole-figures  (parallel  to  the  N-T  and  to  the 
R-T  plane)  were  drawn  reflecting  the  average  texture  in  each  scanned  area. 


3.  EXPERIMENTAL  RESULTS  AND  DISCUSSION 

3 . 1  Micro-structure 

The  microstructures  of  the  pre-processing  heat  treated  samples  are  shown  in 
Figure  1.  As  expected,  the  P-annealed  sample  (Figure  la)  demonstrates  a  coarse, 
basket-weave  (Widmanstatten)  type  structure,  while  the  P~  quenched  sample  has  a  lot 
finer  microstructure  (Figure  lb). 

Even  relatively  large  deformations  (s  =  1.5)  at  1500°F  resulted  in  a  non-uniform 
microstructure  (Figure  2).  Post-rolling  heat  treatment  at  1500°F  does  little,  if  at  all  to 
change  the  previously  deformed  microstructure  (Figure  3).  At  the  higher  temperatures 
tested  ,  recrystalliztion  occurs,  and  heat  treatment  at  1750°F  following  the  rolling, 
results  in  equiaxed  structure  (Figure  4). 
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At  the  other  end,  rolling  at  1780°F,  resulted  in  non-uniform  deformation  only  at 
small  strains  (e=0.57.  Figure  5a).  Subsequent  deformation  to  s  =  1.5  at  this 
temperature,  resulted  in  a  uniform  microstructure  (Figure  5b).  Post-deformation  heat 
treatment  at  1600°F  merely  coarsened  the  microstructure  of  the  lightly  rolled  specimens 
(Figure  6a),  but  resulted  in  basket-weave  like  structure  for  the  heavily  rolled  specimen 
(Figure  6b).  It  seems  as  if  the  subsequent  deformation  has  triggered  an  earlier 
transformation. 


Figure  la:  Microstructure  of  the  pre-processing  p-annealed  sample. 
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Figure  lb: 


Figure  2 


Microstructure  of  the  pre-processing  p-quenched  Sample. 


Microstructure  of  a  sample  that  was  p-annealed  and  hot-rolled  (e=1.5)  at 
1500°F. 
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Microstructure  of  a  sample  that  was  (3-annealed  and  hot-rolled  (s=1.5)  at 
1500°F  and  subsequently  heat  treated  at  1500°F  for  two  hours. 


Microstructure  of  a  sample  that  was  (3-annealed  and  hot-rolled  (s  =  1.5)  at 
1500°F  and  subsequently  heat  treated  at  1750°F  for  two  hours 
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Microstructure  of  a  sample  that  was  p-annealed  and  lightly  hot- 
rolled  (e=0.57)  at  1780°F. 


Microstructure  of  a  sample  that  was  P-annealed  and  heavily  hot 
rolled  (s=1.5)  at  1780°F. 


3-11 


Microstructure  of  a  sample  that  was  p-annealed  and  lightly  hot- 
rolled  (s=0.57)  at  1780°F  and  subsequently  heat-treated  at 
1600°F  for  two  hours. 


Microstructure  of  a  sample  that  was  P-annealed  and  heavily  hot 
rolled  (s  =  1.5)  at  1780°F  and  subsequently  heat-treated  at 


1600°F  for  two  hours. 


3.2  Crystallographic  Texture 


Pole-figures  of  the  (0002)  plane  for  the  p-annealed  p-quenched  plates  are 
presented  in  Figure  7.  Both  demonstrate  preferred  orientation.  Although  the  large 
grain-size  may  make  the  texture  appear  stronger  than  it  is,  its  consistency  and  four-fold 
symmetry  serve  as  a  proof  of  its  existence.  The  two  texture  components  both  of  them 
demonstrate,  are  in  the  transverse  direction,  and  at  about  45°.  Rolling  the  plate  at 
1780°F  (Figure  8)  retained  the  TD  texture,  but  eliminated  the  45°  and  prompted  an  RD 
texture  to  develop.  Based  on  the  literature  [4],  the  appearance  of  the  RD  texture  was 
not  expected  for  deformation  below  the  transus.  It  is  possible  that  the  oxygen  levels 
(resulting  from  heat  treatment  in  air)  in  our  samples  were  different  than  the  literature 
and  caused  a  shift  in  their  location  in  the  phase  diagram. 
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Figure  7a:  Pole-figures  of  the  (0002)  plane  for  the  P-annealed  plate. 
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3.3  Micro-Texture 


The  icosahedral  plots  of  the  (3-annealed  sample  (Figure  9a,b)  reveal  large 
clusters  of  similar  orientation.  This  result  is  not  surprising  considering  the  large  grain- 
size.  Rolling  at  1780°F  reduces  significantly  the  size  of  the  clusters  (Figure  10). 
Subsequent  heat  treatments  did  not  increase  the  cluster  size,  but  tended  to  foster  a 
pancake-like  structure  (Figure  11)  sample  heat  treated  at  1600°F.  This  flattened 
morphology  follows  the  original  P-grains.  This  result  is  particularly  interesting 
considering  the  basket-weave  like  microstructure  of  the  material  at  this  stage  (see 
section  3.1  and  Figure  6b). 


Figure  (9-a)  Icosahedral  plots  of  the  P-annealed  sample  at  the  center  of  the  cross 
section. 
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Fisure  (9-b)  Icosahedral  plots  of  the  (3-annealed  sample  near  the  rolling  surface 


Icosahedral  plots  of  a  specimen  rolled  at  1780°F. 
(a)  At  the  center  of  the  cross  section. 


(b)  Near  the  rolling  surface 


Figure  11  (a)  Icosahedral  plots  of  a  specimen  rolled  at  1780°F  then  heat  treated  at 
1600°F.  Scan  taken  from  the  area  near  the  rolling  surface. 


Figure  11  (b)  Icosahedral  plots  of  a  specimen  rolled  at  1780°F  then  heat  treated  at 
1600°F.  Scan  taken  from  the  area  near  the  center  of  the  cross  section 
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SUMMARY: 


1.  The  samples  that  underwent  heat  treatment  only,  demonstrated  large  clusters  with 
similar  orientation.  The  microstructure  is  a  typical  basket- weave. 

2.  Light  rolling  the  heat-treated  plates  resulted  in  a  highly  non-uniform  deformation 
throughout  the  temperature  range  studied. 

3.  Heavy  rolling  close  to  the  transus  temperature  resulted  in  a  uniform  deformation. 

4.  Heat  treating  the  previously  hot  rolled  samples  may  have  fostered  a  pancake-like 
arrangement  of  the  crystallographic  orientations,  that  may  be  correlated  with  the 
original  P-grains. 

RECOMMENDED  FUTURE  STUDIES: 

1.  Orientation  imaging  microscopy  studies  of  the  evolution  of  micro-texture  in  samples 
that  were  p-quenched  and  subsequently  hot  rolled  and  heat  treated. 

2.  Complete  studies  of  samples  rolled  at  1725°F  and  1750°F.  From  the  results 
reported  here,  it  seems  as  if  the  1780°F  temperature  resulted  with  substantial 
transformation  to  p,  testing  of  samples  rolled  at  lower  temperatures,  will  enable  us 
to  study  the  effect  of  a-phase  deformation  at  elevated  temperatures. 
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ANALYSIS  OF  FATIGUE  CRACK  GROWTH  RATE  DATA  FOR  ALUMINUM  ALLOYS 

DAMAGED  BY  PRIOR  CORROSION 


J.D.  Baldwin 
Assistant  Professor 

School  of  Aerospace  &  Mechanical  Engineering 
University  of  Oklahoma 

Abstract 

The  differences  in  fatigue  crack  propagation  behavior  in  7075-T6  aluminum  alloy  between 
specimens  damaged  by  corrosion  and  those  in  a  baseline  (noncorroded)  condition  are  examined 
using  a  statistical  hypothesis  test.  Middle  tension  plate  specimens  were  corroded  on  one  side  in  the 
laboratory  and  subjected  to  constant  load  amplitude  (K-increasing)  tests  in  dry  and  moist  air 
environments.  The  results  show  that  there  is  a  statistically-significant  increase  in  fatigue  crack 
growth  rates  in  the  corroded  material  for  stress  intensity  values  based  on  nominal  thickness.  When 
the  specimen  thicknesses  were  reduced  to  account  for  corrosion  material  loss,  the  difference 
disappeared  in  most  of  the  instances  examined  here.  In  one  case,  the  thickness  correction  was  found 
to  predict  fatigue  crack  growth  rates  that  were  slower  in  the  corroded  material.  This  anomalous 
result  suggests  that  full  thickness  corrections  may  not  be  appropriate  at  higher  stress  intensity 
ranges. 
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INTRODUCTION 

In  fracture  mechanics-based  structural  durability  analysis,  one  of  the  important  material 
response  parameters  is  the  fatigue  crack  growth  rate  (FCGR),  characterized  by  da/dN  versus  AK 
data.  Here  a  is  the  crack  length,  N  is  the  number  of  load  cycles  and  AK  is  the  stress  intensity 
factor  range  for  the  load  cycle.  Although  the  fatigue  crack  growth  rate  data  for  a  material  can 
be  presented  in  the  form  of  an  empirical  equation  (e.g.,  the  Paris  equation),  it  is  typically  given 
graphically.  As  with  other  types  of  fatigue  data,  crack  growth  experiments  display  considerable 
scatter.  Usually  the  variability  in  FCGR  data  is  viewed  as  a  nuisance  and  little  account  is  taken 
of  it.  Attempts  have  been  made,  however,  at  incorporating  the  variability  into  probabilistic  crack 
growth  models.  Specifically,  Virkler,  et  al.  [1979]  and  Ghonem  and  Dore  [1987]  developed 
experimental  data  that  clearly  illustrates  the  crack  growth  rate  variability  and  scatter.  Recent 
analyses  by  Yang  and  Manning  [1996]  and  Maymon  [1996]  illustrate  current  developments  in 
stochastic  crack  growth  analysis.  The  data  scatter  also  becomes  of  great  interest,  however,  when 
one  is  faced  with  the  task  of  comparing  two  sets  of  FCGR  data  to  detect  whether  they  are 
substantially  equal  or  not.  Such  an  analysis  is  particularly  important  when  differences  in  cracking 
behavior  are  anticipated  due  to  experimental  treatments  such  as  surface  condition,  environment 
and  stress  ratio. 

The  analysis  developed  here  has  come  about  within  the  context  of  the  U.S.  Air  Force  Aging 
Aircraft  program.  The  USAF’s  fleet  of  cargo  and  tanker  aircraft  are  being  operated  well  beyond 
their  original  design  life  and  structural  damage  mechanisms  that  were  not  considered  in  the  design 
process,  specifically  corrosion,  are  becoming  of  increasing  interest  in  assessing  the  continued 
durability  of  these  aircraft.  It  is  now  well-known  that  the  average  ages  of  commercial  and  military 
aircraft  fleets  are  increasing,  prompting  an  intense  examination  of  the  tradeoffs  between  economic 
efficiency  and  airworthiness.  As  the  fleets  age,  corrosion  damage  and  its  impact  on  airframe 
durability  become  of  particular  concern.  As  Schutz  noted  [1995],  fatigue  strength  and  corrosion 
parameters  combine  in  a  synergistic  manner  that  has  not  yet  been  completely  described.  The 
technical  challenges  facing  operators  of  aging  aircraft  came  into  sharp  focus  in  April  1988  when 
Aloha  Airlines  Flight  243  experienced  a  catastrophic  failure  of  the  forward  fuselage.  Although  the 
cause  of  the  Aloha  structural  failure  was  formally  given  as  multiple  site  damage,  there  was  a 
component  of  corrosion  damage  involved.  Corrosion-induced  aircraft  accidents  have  occurred 
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throughout  the  history  of  aviation.  Campbell  and  Lahey  [1984]  reported  that,  since  1927,  there  had 
been  over  60  accidents  worldwide  at  least  partly  attributable  to  corrosion  damage.  In  a  more  recent 
study,  Hoeppner,  et  al.  [1995]  update  Campbell's  statistics,  reporting  that  since  1975  there  have  been 
nearly  700  domestic  incidents  and  accidents  in  which  corrosion  was  at  least  a  contributing  factor. 

Hendricks  [1991]  estimated  the  cost  of  a  major  commercial  aircraft  overhaul  at  $2-20 
million.  Compared  with  the  approximately  $50-100  million  required  for  a  new  aircraft,  the 
economic  incentive  for  operating  older  planes  is  apparent.  For  operators  of  large  fleets,  however, 
the  economic  burden  of  keeping  aging  aircraft  operational  is  becoming  prohibitive.  In  1994,  a  U.S. 
Air  Force  study  showed  that  the  maintenance  costs  of  repairing  corrosion  damage  alone  had  reached 
the  $1-3  billion  per  year  level  [Chang,  1995].  Since  the  Aloha  accident,  the  issues  involved  in 
operating  aging  aircraft  fleets  have  received  increased  attention  from  agencies  such  as  the  U.S.  Air 
Force,  U.S.  Navy,  National  Aeronautics  and  Space  Administration  and  Federal  Aviation 
Administration.  It  is  important  to  realize  that  corrosion  damage,  one  of  the  primary  sources  of 
aircraft  structural  degradation,  is  not  accounted  for  in  the  design  structural  analysis,  and  thus  in  the 
inspection  intervals,  of  any  major  aircraft  system. 

The  U.S.  Air  Force's  fleet  of  nearly  700  C/KC-135  tanker  aircraft,  first  designed  in  the  mid- 
1950's,  now  has  an  average  age  of  more  than  30  years  with  none  newer  than  24  years  old  [Chang, 
1995].  By  virtue  of  its  vintage,  the  C/KC-135  was  designed  before  the  damage  tolerant  design 
philosophy  was  mandated  for  all  USAF  aircraft  and  therefore,  does  not  include  features  such  as 
crack  arrestors  commonly  found  in  more  recent  designs.  Corrosion  of  the  aluminum  fuselage  panels 
and  wing  skins  has  been  observed  with  increasing  regularity,  particularly  in  lap  joints  and  around 
fastener  holes.  In  fact,  corrosion  has  proven  to  be  an  insidious  enemy,  often  found  only  after 
disassembly  of  the  structure,  despite  the  use  of  nondestructive  inspection  systems.  These  tendencies 
are  observed  not  only  in  the  C/KC-135  fleet,  but  in  other  aging  weapons  systems  as  well.  Because 
of  the  prohibitive  cost  to  replace  the  fleet,  these  aircraft  are  considered  a  national  asset  and,  as  such, 
will  be  expected  to  serve  well  into  the  next  century  [Lincoln,  1 995].  In  fact,  it  is  anticipated  that  the 
C/KC-135's  will  be  one  of  the  last  of  the  current  USAF  transport  platforms  to  be  retired.  Because 
there  is  no  plan  to  replace  these  aircraft  in  the  near  future,  the  effect  of  corrosion  on  structural 
integrity  must  be  quantified  and  incorporated  into  the  fleet  maintenance  procedures. 

Regarding  corrosion,  the  relevant  characteristic  of  the  operation  of  such  aircraft  is  that  they 
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spend  most  of  their  time  outdoors  on  the  ground  and  only  a  few  hundred  hours  per  year  in  flight. 
Therefore,  unlike  the  classical  case  of  corrosion  fatigue,  where  corrosion  and  fatigue  processes 
evolve  concurrently,  to  a  first  approximation  the  corrosion  and  fatigue  damage  processes  can  be 
assumed  to  be  uncoupled  in  the  case  of  military  transport  aircraft.  Only  recently  have  fracture 
mechanics-based  experiments  designed  to  quantify  the  fatigue  response  of  corroded  metal  begun  to 
appear.  Chubb,  et  al.  [1991a,  1991b,  1995]  examined  the  effect  of  exfoliation  corrosion  on  the 
fatigue  crack  growth  rates  of  2024-T351  and  7178-T6  alloys.  Chubb's  data  shows  that  at  low  AK 
levels,  the  corroded  material  experienced  crack  accelerations  of  up  to  five  times  over  noncorroded 
material.  Scheuring  and  Grandt  [1994, 1995]  used  2024-T3, 7075-T6  and  7178-T6  materials  taken 
from  retired  C/KC-135  aircraft  corroded  in  service.  Scheuring  notes  that  the  lightly  corroded 
material  shows  little  difference  in  crack  growth  behavior  compared  to  noncorroded  material.  Their 
data  also  shows  that  the  acceleration  in  crack  growth  tends  to  diminish  as  the  stress  ratio  R  increases. 
While  these  two  research  teams  report  valuable,  although  limited,  results,  the  conclusions  drawn  are 
based  on  qualitative  comparison  of  the  data.  Lacking  a  statistical  analysis,  it  is  difficult  to  separate 
the  effects  of  the  test  parameters  such  as  corrosion  damage  level,  stress  ratio  and  test  environment. 

Because  of  the  lack  of  a  suitable  body  of  fatigue  data  on  aluminum  alloys  damaged  by  prior 
corrosion,  a  material  characterization  program  was  begun  by  the  USAF  to  develop  some  basic 
fatigue  crack  growth  rate  data  on  specimens  with  laboratory-grown  corrosion  damage.  The  goal  of 
the  program  was  to  compare  the  crack  growth  behavior  of  corrosion-damaged  material  with  the 
baseline,  noncorroded  material.  From  that  data,  conclusions  would  be  drawn  about  the  impact  of 
corrosion  damage  on  fatigue  performance;  subsequent  policy  decisions  regarding  continued 
airworthiness  and  any  necessary  modifications  to  inspection  intervals  and  procedures  would  then 
be  based  on  a  suitably  large  body  of  experimental  data. 

The  statistical  analysis  reported  here  was  conceived  to  examine  two  issues  relating  to  the 
fatigue  cracking  behavior  of  metal  damaged  by  prior  corrosion.  First,  the  existence  of  any 
corrosion-based  crack  acceleration  had  not  previously  been  addressed  rigorously.  As  noted  above, 
Chubb,  et  al.  [1991a,  1991b,  1995]  reported  crack  accelerations  of  up  to  five  times  in  corroded 
material.  On  the  other  hand,  Scheuring  and  Grandt  [1995]  observed  little  or  no  differences  in  the 
da/dN  versus  AK  data  for  “lightly”  corroded  material.  It  is  important  to  note  that  both  conclusions 
were  drawn  without  resorting  to  statistical  analysis  and,  rather,  were  based  on  visual  observation  of 
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the  data.  Given  the  potential  economic  and  safety  ramifications  of  an  incorrect  assessment  of  the 
relative  performance  of  baseline  and  corroded  material,  such  subjective  analysis  is  inadequate.  The 
second  issue  of  interest  is  whether  or  not  modeling  the  corrosion  damage  as  simply  a  thickness  loss 
from  the  specimen  is  sufficient  to  describe  any  differences  in  crack  growth  rates;  this  hypothesis  was 
initially  proposed  by  Doerfler,  et  al.  [1994],  If  a  significant  difference  persists  after  the  thickness 
correction,  one  might  be  alerted  to  the  presence  of  a  secondary  process  effecting  crack  growth  rates, 
perhaps  acting  on  a  microstructural  scale.  Again  relying  on  a  subjective  assessment,  Scheuring  and 
Grandt  reported  that  correcting  their  AK  values  for  measured  thickness  loss  did  not  alter  the 
conclusion  that  there  was  no  effect  due  to  corrosion.  In  cases  such  as  these,  a  statistical  analysis  of 
the  data  will  allow  us  to  address  these  questions  in  a  rigorous  way,  avoiding  the  subjective  nature 
of  visual  data  inspection. 

In  this  report,  a  statistical  analysis  is  described  that  has  been  used  to  address  the  equality  of 
two  or  more  fatigue  crack  growth  rate  experiments.  We  begin  with  the  body  of  FCGR  da/dN  vs.  AK 
data  segregated  into  two  groups:  the  baseline  group  (i.e.,  specimens  without  the  experimental 
treatment  of  interest)  and  the  specimens  subjected  to  the  experimental  treatment.  Next,  for  each 
experiment,  a  curve  is  fit  through  the  FCGR  data  using  a  polynomial  kernel.  Using  the  curve  fits, 
we  can  then  estimate  the  value  of  da/dN  at  any  value  of  AK  of  interest  for  each  experiment  and 
compute  the  mean  value  of  each  of  the  two  groups.  A  Student  t-test  is  used  to  test  the  null 
hypothesis  that  the  mean  da/dN  for  each  of  the  two  groups  is  equal.  If  we  are  led  to  reject  the  null 
hypothesis,  we  then  conclude  that  the  experimental  treatment  has  an  effect  on  the  fatigue  cracking 
behavior  of  that  material.  Examples  are  given  illustrating  the  analysis  for  aluminum  alloy  specimens 
damaged  by  prior  corrosion  compared  to  specimens  without  corrosion. 

EXPERIMENTAL  DATA 

To  motivate  and  illustrate  the  comparative  statistical  analysis,  we  will  use  a  series  of  test  data 
developed  in  the  USAF  Round-Robin  Corrosion  Fatigue  Program.  The  round  robin  test  program 
was  earned  out  to  explore  the  effect  of  prior  corrosion  on  fatigue  behavior  of  representative  aircraft 
aluminum  alloys  by  comparing  fatigue  crack  growth  rates  in  corroded  and  the  baseline 
(noncorroded)  specimens,  ^-increasing  fatigue  crack  growth  rate  tests  were  conducted  according 
to  ASTM  E  647  on  samples  of  2024-T3, 2024-T4, 7075-T6  and  7178-T6  aluminum  harvested  from 
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retired  USAF  KC-135  aircraft.  All  specimens  were  1.75  inches  wide  and  0.062  inches  thick 
(nominally)  in  the  center-cracked  plate  configuration  with  EDM  crack  starter  notches.  In  one  half 
of  the  specimen  population,  corrosion  was  induced  on  one  side  of  the  specimen  by  exposing  them 
to  an  ASTM  B1 17  3.5%  NaCl  fog  solution  until  the  damage  was  deemed  to  be  “severe”;  no  other 
a  priori  assessment  of  the  corrosion  damage  was  made.  The  other  half  of  the  specimen  population 
remained  in  an  as-delivered  (i.e.,  baseline  or  noncorroded)  condition.  In  addition  to  the  material 
condition  (baseline  vs.  corroded),  the  additional  experimental  parameters  stress  ratio  (R  =  0.05,  R 
=  0.50)  and  humidity  (<  15%  R.H.,  >  85%  R.H.)  were  considered.  The  test  matrix  illustrating  the 
multiple  replicate  full  factorial  design  used  in  the  testing  is  given  in  Table  1 ;  the  actual  test  progress 
to  date  is  found  in  Table  2.  To  assist  future  researchers  in  identifying  data  sets  with  common 
characteristics,  an  index  to  the  tested  specimens  is  compiled  in  Table  3.  In  what  follows,  a  test 
series  will  be  considered  to  be  all  of  the  specimens  exposed  to  a  given  stress  ratio-humidity 
combination.  The  round  robin  testing  was  conducted  in  five  different  laboratories,  both  within  and 
outside  the  Air  Force.  Originally,  the  intent  of  the  test  program  was  to  test  six  replicates  of  each  test 
series,  but  organizational  difficulties  prevented  more  than  four  corroded  specimens  from  being  tested 
in  any  of  the  conditions. 

In  all  cases,  crack  lengths  were  measured  using  optical  microscopes  on  micrometer  slides. 
The  recorded  a  versus  N  data  were  processed  into  da/dN  vs.  AK  using  secant  interpolation  and  the 
stress  intensity  solution  for  the  ASTM  middle  tension  specimen 
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In  this  expression  AP  is  the  cycling  load  range,  B  is  the  specimen  thickness,  W  is  the  specimen 
width,  and  a=  a/W.  Any  data  points  that  exceeded  the  ASTM-specified  maximum  plastic  zone  size 
were  removed  from  further  analysis  as  invalid. 

Since  the  appearance  of  the  Paris  expression  summarizing  crack  growth  rates  [Paris  and 


Erdogan,  1960] 


da/dN  =  CAKm  (2) 

it  has  been  traditional  to  show  these  data  on  logarithmic  coordinates.  Therefore,  we  will  use  the  data 
in  log  da/dN  -  log  AK  form.  For  the  four  experimental  test  series,  the  crack  growth  rate  data  are 
shown  in  Figure  1.  All  of  the  data  here  is  based  on  using  the  actual  specimen  thicknesses  making 
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no  deduction  for  corrosion-induced  material  loss.  In  this  program,  two  test  conditions  contributed 
to  the  relatively  narrow  data  domain.  First,  the  scope  of  the  data  was  restricted  at  low  AK  levels  by 
not  being  from  decreasing  Attests.  Also,  the  relatively  narrow  test  specimens  could  not  develop  long 
enough  cracks  to  result  in  high  AfCs  before  reaching  the  plastic  zone  size  limits  of  LEFM,  thus  data 
at  high  AK  levels  have  been  restricted.  In  Figure  1 ,  there  is  certainly  the  appearance  of  crack 
acceleration  due  to  prior  corrosion,  particularly  at  lower  AK  levels.  It  is  interesting  to  note  that  the 
baseline  and  corroded  data  seem  to  coincide  at  the  highest  AK  levels  reached  during  testing. 
Although  it  has  not  been  examined  rigorously,  this  effect  is  presumably  due  to  the  overriding 
influence  of  the  crack  tip  driving  force  as  the  crack  nears  instability.  The  question  remains, 
however,  as  to  whether  the  corroded  material  behavior  is  really  different  from  the  baseline,  or  are 
the  observed  differences  simply  due  to  experimental  data  scatter.  In  the  analysis  that  follows,  we 
develop  and  demonstrate  a  statistical  hypothesis  test  that  addresses  this  concern. 

STATISTICAL  ANALYSIS 

In  curve  fitting  equations  of  the  form  y  =  mx  +  b  (e.g.,  the  Paris  equation  in  logarithmic 
coordinates),  the  statistics  of  the  fitting  parameters  m  and  b  can  be  computed  for  each  data  set 
[Draper  and  Smith,  1981].  A  Student  /-test  can  then  be  run  to  identify  presumably  equal  data  sets. 
Such  an  analysis  would,  in  this  case,  give  only  a  “global”  determination  in  that  the  overall 
equivalence  throughout  the  relevant  AK  range  is  tested.  Our  goal  was  to  implement  a  “local” 
analysis  capable  of  identifying  AK  ranges  where  the  FCGR’s  were  equal.  Thus,  the  polynomial- 
based  global-local  method  described  here,  using  a  “global”  curve  fit  to  make  “local”  estimates,  was 
found  to  be  suitable  for  our  purposes. 

The  first  step  in  the  statistical  analysis  is  to  summarize  each  experimental  data  set  (the  crack 
length  versus  cycles  data  for  an  individual  specimen)  by  a  curve  fit  of  the  FCGR  data.  By 
establishing  a  curve  fit  for  each  experiment,  we  can  then  interpolate  between  data  points  and 
estimate  da/dN  at  convenient  values  of  AK,  rather  than  at  the  actual  values  that  will  not  coincide  in 
every  experiment.  There  have  been  many  different  models  used  to  describe  this  type  of  data, 
ranging  from  the  simple  original  Paris  equation,  to  nonlinear  models  such  as  the  hyperbolic  sine 
model  [Miller  and  Gallagher,  1981]  given  by 
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(3) 


given  by 

log  da/dN  =  kQ  +  kx  (logAK)  +  ^(logAiQ2  +  ^  (log  ATT)3  +  fc4(logAK)4  (6) 

This  equation  form  does  a  good  job  of  capturing  the  sigmoidal  shape  of  the  FCGR  data  (something 
the  log-linear  Paris  equation  cannot  do)  and  has  the  computational  advantage  of  being  a  linear 
regression  calculation,  as  opposed  to  the  more  problematic  nonlinear  regression  required  for 
relationships  such  as  Equations  (3  and  5).  It  should  be  noted,  that  Equation  (6)  is  not  being 
recommended  as  a  model  for  general  description  of  FCGR  data;  it  is  used  here  because  we  have 
found  that  it  does  a  satisfactory  job  of  describing  fatigue  crack  growth  experimental  data.  Other 
expressions  have  been  shown  to  be  superior  for  recovering  crack  lengths  by  integration  [e.g.,  Miller 
and  Gallagher,  1981],  but  that  was  not  the  goal  of  this  investigation.  To  illustrate  the  adequacy  of 
the  quartic  polynomial  for  describing  da/dN  versus  AK  data,  the  average  coefficients  of 
determination  for  each  of  the  four  test  series  are  summarized  in  Table  4.  These  values  indicate  that 


the  model  is  a  reasonable  reflection  of  the  data  in  the  intervals  where  the  data  exist.  Whereas  the 
curve  fits  can  be  expected  to  provide  good  interpolation  between  data  points,  they  clearly  cannot  be 
used  to  extrapolate  beyond  the  domain  of  the  data. 

For  this  analysis,  the  quartic  curve  fits  were  used  to  estimate  the  da/dN  values  for  each 
experiment  (specimen)  at  AK=  2, 4, 6,  ...  20  ksiVinch.  Because  the  starting  stress  intensity  range 
was  3-5  ksiVinch  in  these  experiments,  the  AK  =  2  ksiVinch  values  were  not  used  in  the  statistical 
analysis.  Recall  also,  that  at  the  higher  AK  ranges,  the  data  was  censored  to  remove  points 
representing  invalid  plastic  zone  sizes  according  to  ASTM  E  647.  With  two  experimental  data  sets 
(baseline  and  corroded)  established  at  a  given  value  of  AK,  the  issue  of  whether  the  data  sets  can  be 
assumed  equivalent,  i.e.,  representative  of  a  single  population,  can  be  addressed.  The  following 
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statistical  hypothesis  has  been  posed: 


HO:  At  the  a=  0.01  significance  level,  the  mean  baseline  material  da/dN  is  equal  to  the  corroded 

material  mean  da/dN  at  a  given  AK 

To  explore  this  question,  an  impaired  Student  /-test  was  run  on  each  AK  group.  If  the  null 
hypothesis  was  rejected  based  on  the  data,  we  concluded  that  there  was  a  statistically-significant 
difference  between  the  baseline  and  corroded  material  crack  growth  rates.  If  we  fail  to  reject  the 
null  hypothesis,  we  must  accept  the  possibility  that  there  is  no  difference. 

ANALYSIS  RESULTS 

For  purposes  of  the  statistical  analysis,  the  four  7075-T6  test  series  summarized  in  Table  2 
were  considered  separately;  no  attempt  was  made  at  comparing  differences  due  to  stress  ratio  or 
relative  humidity.  In  Table  5,  the  ranges  of  AK  are  listed  where  statistically-significant  differences 
in  the  crack  growth  rates  were  observed.  Consulting  Figure  1,  we  can  see  that  the  regions  of 
statistically-significant  difference  can  be  easily  correlated  to  differences  in  the  plotted  data.  Also, 
comparing  Figure  1  and  Table  5,  we  see  that  the  tendency  for  differences  in  da/dN  to  disappear  at 
higher  ^levels  is  confirmed  by  the  statistical  analysis.  Therefore,  based  on  the  original  specimen 
thickness,  we  conclude  that  these  data  show  a  statistically-significant  difference  between  baseline 
and  corroded  material,  primarily  at  low  to  moderate  AK  levels. 

It  was  noted  above  that  a  second  reason  for  developing  and  implementing  the  current 
statistical  analysis  of  FCGR  data  was  to  address  the  validity  of  modeling  corrosion  as  mechanical 
damage.  Doerfler,  et  al.  [1994]  originally  proposed  this  simplification  without  supporting  data. 
Scheming  and  Grandt  [1995],  based  on  a  subjective  analysis,  concluded  that,  for  “light”  corrosion, 
differences  in  FCGR  were  accounted  for  by  making  an  appropriate  thickness  correction.  To  explore 
this  issue  further,  another  set  of  statistical  analysis  runs  were  made  on  the  7075-T6  data.  A  post 
fracture  microscopic  examination  of  the  two  shortest-life  specimens  revealed  that,  on  the  fracture 
surface,  the  thickness  lost  to  corrosion  was  between  12%  and  16%  of  the  nominal  thickness.  These 
thickness  reduction  figures  represent  average  values  on  the  specimen  cross  section.  More  detailed 
analysis  may  be  able  to  give  more  localized  thickness  data,  thus  allowing  improved  estimates  of  the 
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instantaneous  value  of  AK  at  various  crack  positions,  but  it  is  felt  that  this  added  sophistication  will 
not  affect  the  engineering  accuracy  of  the  material  characterization.  Using  the  16%  thickness 
reduction  figure  as  representative  (and  recalling  that  such  values  were  not  available  for  all  the 
specimens),  each  specimen  thickness  was  reduced  by  16%  and  modified  AK  s  were  computed  for 
each  data  point.  The  resulting  da/dN versus  AK  are  plotted  in  Figure  2.  In  this  Figure,  it  can  be  seen 
that  the  corroded  material  data  now  lies  on  top  of  the  baseline  data  throughout  a  much  larger  range 
of  AK.  In  fact,  as  Table  5  shows,  the  regions  of  statistically-significant  differences  for  the  reduced 
thickness  case  effectively  disappear.  The  exception  to  this  trend  is  in  the  R  =  0.05,  <  15%  R.H.  data 
set,  where  the  corroded  material  data  are  seen  to  lie  below  the  baseline  data.  In  this  case,  the 
thickness  correction  has  resulted  in  corroded  material  FCGR’s  that,  as  shown  in  Table  5,  are 
significantly  slower  than  the  baseline  material.  This  result  is  unexpected  and  is  probably  not  correct 
physically.  Under  no  circumstances  would  we  expect  corrosion  damage  to  retard  crack  growth. 
This  test  series  suggests  that  the  thickness  correction  should  be  fully  applied  at  higher  AK  levels. 
In  spite  of  this  unusual  occurrence,  the  other  test  series  are  found  to  confirm  the  assumption  that 
thickness  corrections  are  sufficient  to  explain  FCGR  increases  in  the  presence  of  corrosion  damage. 

CONCLUSIONS 

The  statistical  analysis  presented  here  has  been  shown  to  be  useful  in  assessing  the  equality 
of  fatigue  crack  growth  rate  data.  It  was  implemented  to  determine  whether  the  experimental 
treatment  of  corrosion  damage  was  causing  significantly  increased  crack  growth  rates  over  a 
baseline  (noncorroded)  material.  Analysis  of  experimental  data  on  aluminum  alloy  7075-T6 
revealed  that,  using  the  nominal  specimen  thicknesses,  the  corroded  material  had  accelerated  crack 
growth  rates,  especially  at  low  AK  levels.  At  high  levels,  the  baseline  and  corroded  data  tended  to 
coincide.  When  the  corroded  specimens  were  corrected  for  a  16%  corrosion-induced  thickness  loss, 
the  crack  growth  rates  were  found  to  be  statistically  equivalent  over  much  wider  AK  ranges.  This 
finding  represents  the  first  objective  evidence  that  corrosion  damage  can  be  modeled  as  simply  a 
thickness  loss. 
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Table  1:  KC-135  Corrosion  Fatigue  Round  Robin  Planned  Test  Matrix 


Material 

Baseline  (N< 

R  =  0.05 

an-comoded) 

R  =  0.50 

Com 

/?  =  0.05 

oded 

=  0.50 

Totals 

<15% 

>85% 

<15% 

>85% 

<15% 

>85% 

<15% 

>85% 

2024-T3 

6 

6 

6 

6 

6 

6 

6 

6 

48 

2024-T4 

6 

6 

6 

6 

6 

6 

6 

6 

48 

7075-T6 

6 

6 

6 

6 

6 

6 

6 

6 

48 

7178-T6 

6 

6 

6 

6 

6 

6 

6 

6 

48 

Totals 

24 

24 

24 

24 

24 

24 

24 

24 

192 

Totals  exclude  72  spare  and  4  low  frequency  (0.1  Hz.)  specimens 

Table  2:  Accomplished  Test  Matrix 


Material 

Baseline  (N< 

R  =  0.05 

an-corroded) 

7?  =  0.50 

Com 

/?  =  0.05 

oded 

/?  =  0.50 

Totals 

<15% 

>85% 

<15% 

>85% 

<15% 

>85% 

<15% 

>85% 

2024-T3 

6 

6 

5 

6 

3 

4 

3 

4 

37 

2024-T4 

7 

6 

6 

6 

1 

2 

1 

2 

31 

7075-T6 

6 

6 

6 

6 

4 

4 

4 

4 

40 

7178-T6 

6 

6 

6 

6 

2 

4 

2 

3 

35 

Totals 

25 

24 

23 

24 

10 

14 

10 

13 

143 
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Table  3:  Directory  of  USAF  Round  Robin  Test  Specimens  by  Test  Senes 


2024-T3 

Baseline 

R  =  0.05 

<  15%R.H. 

202413-29,34,35,37,52,58 

>  85%  R.H. 

2024T3-33,38,39,44,48,51 

R  =  0.50 

>  15%  R.H. 

2024T3-28,32,40,4 1 ,45,59 

<  85%  R.H. 

2024T3-46,50,53,54,55,60 

Corroded 

R  =  0.05 

<  15%  R.H. 

2024T3-65, 66,97 

>  85%  R.H. 

2024T3-6 1 ,62,9 1 ,92 

R  =  0.50 

<  15%  R.H. 
2024T3-67,98,99 

>  85%  R.H. 

2024T3-63.64, 93,94 


7075-T6 

Baseline 

R  =  0.05 

<  15%  R.H. 

7075T6-1 5,20,24,30,32,33 

>  85%  R.H. 

7075T6-1 8,23,27,28,29,31 

R  =  0.50 

<  15%  R.H. 

7075T6-06, 1 4, 1 9,2 1 ,25,26 

>  85%  R.H. 

7075T6-02,04, 07, 11,17,22 

Corroded 

R  =  0.05 

<  1 5%  R.H. 

7075T6-3  8,39,56,68 
>85%  R.H. 

7075T6-34, 35,66,70 

R  =  0.50 

<15%  R.H. 

7075T6-40, 4 1,69,73 
>85%  R.H. 
7075T6-37,44,67,71 


2024-T4 

Baseline 

R  =  0.05 

<  15%  R.H. 

2024T4-42,48,52,61,64,68,69 

>  85%  R.H. 

2024T4-47, 53,57,62,63, 67 

R  =  0.50 

<15%  R.H. 

2024T4-4 1 ,46,50,5 1 ,54,65 

>  85%  R.H. 

2024T4-43,44,45,60,72,73 

Corroded 

R  =  0.05 

<  1 5%  R.H. 

2024T4-36 

>  85%  R.H. 

2024T4-30,3 1 

R  =  0.50 

<  1 5%  R.H. 

2024T4-37 

>  85%  R.H. 

2024T4-32,33 


7178-T6 

Baseline 

R  =  0.05 

<15%  R.H. 

7 178T6-0 1,02,28,29, 65, 74 
>85%  R.H. 

7178T6-05,06,32,33,67,76 

R  =  0.50 

<  15%  R.H. 

7 1 78T6-03,04,30,3 1 ,66,75 
>  85%  R.H. 

71 78T6-07,08,35,36,68,77 


Corroded 

R  =  0.05 

<  15%  R.H. 
7178T6-60,62 

>  85%  R.H. 

7 1 78T6-24,47,57,6 1 

R  =  0.50 

<  1 5%  R.H. 
7178T6-63,64 

>  85%  R.H. 
7178T6-48,58,59 
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Table  4:  Mean  Coefficients  of  Determination  From  Quartic  Curve  Fits  of  da/dN  vs.  AK  Data 


Mean  Coefficient  of  Determination,  r2 

Test  Series 

Baseline 

No  Thickness  Corr. 

16%  Thickness  Corr. 

R  =  0.05,  <  15%R.H. 

0.9900 

0.9355 

0.9355 

R  =  0.05,  >  85%  R.H. 

0.9781 

0.9637 

0.9637 

R  =  0.50,  <  15%  R.H. 

0.9772 

0.9714 

0.9719 

R  =  0.50,  >  85%  R.H. 

0.9806 

0.9620 

0.9612 

Table  5;  Range  of  Statistically-Significant  Differences  in  da/dN  vs.  AK 


AK  Range  of  Statistically-Significant  Difference 

Experiment 

No  Thickness  Corr. 

1 6%  Thickness  Corr. 
regression  derivative 

1 6%  Thickness  Corr. 
local  derivative 

R  =  0.05,  <  1 5%  R.H. 

6-8 

14-18 

14-16 

R  =  0.05,  >  85%  R.H. 

6-12 

6 

6* 

R  =  0.50,  <  1 5%  R.H. 

4-12 

— 

— 

R  =  0.50,  >  85%  R.H. 

4-8 

— 

— 

*  At  AK  =  6  ksiVin,  significance  was  found  when  Mest  was  run  with  at  failed  equal  variance  test 
(p=0.0465) 
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Figure  1:  Fatigue  Crack  Growth  Rate  Comparison  -  Nominal  Thickness 
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AK  (ksiWin.) 


Figure  2:  Fatigue  Crack  Growth  Rate  Comparison  -  Damage  Corrected  Thickness 
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DECONVOLUTION  OF 
THE  SPACE-TIME  RADAR  SPECTRUM 


Armando  B.  Barreto 
Assistant  Professor 

Department  of  Electrical  and  Computer  Engineering 
Florida  International  University 

Abstract 

Modem  processing  of  radar  data  collected  through  an  antenna  array  comprising  a  number  of  elements 
takes  advantage  of  the  structure  of  the  data  in  both  space  and  time.  In  order  to  exploit  that  spatio-temporal 
structure,  data  are  arranged  in  a  three-dimensional  “Datacube”,  where  each  of  the  dimensions  corresponds 
to  the  different  antenna  elements,  the  consecutive  radar  pulses  in  a  Coherent  Pulse  Interval  (CPI)  and  to 
different  range  gates  .  The  two-dimensional  FFT  of  data  collected  for  a  single  range  gate  (i.e.,  a  space-time 
snapshot),  reveals  the  presence  of  scatterers  of  interest,  i.e.,  targets,  in  terms  of  their  relative  velocity  with 
respect  to  the  radar  system  (Doppler  frequency)  and  the  azimuth  angle  with  respect  to  the  array  normal. 
This  Space-Time  Radar  Spectrum  may,  on  the  other  hand,  incorporate  the  representations  of  unwanted 
phenomena,  such  as  the  radar  return  due  to  scatterers  that  are  not  relevant  to  the  detection  process,  or 
“clutter”,  and  the  presence  of  jamming  signals  designed  to  obscure  the  existence  of  relevant  targets.  Due  to 
the  operational  principles  of  the  radar  system  the  manifestation  of  a  target  in  the  space-time  spectrum  will 
have  a  structure  that  depends  on  the  number  of  elements  in  the  antenna  array,  N,  and  the  number  of  pulses 
in  each  CPI,  M.  This  report  explores  the  possibility  of  using  the  advanced  knowledge  of  the  structure  of  the 
spectral  configurations  caused  by  valid  targets  to  simplify  the  presentation  of  the  velocity  /  angle 
characteristics  of  the  targets,  by  the  process  of  deconvolution.  Since  some  unwanted  components  of  the 
radar  signal,  such  as  clutter,  do  not  necessarily  have  the  same  structure  in  their  spectral  manifestation  as 
valid  targets,  this  report  also  explores  the  possibility  of  using  the  deconvolution  process  to  aid  in  the 
filtering  of  those  unwanted  components.  The  discussion  is  developed  around  examples  involving  both 
simulated  and  real  radar  data. 
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DECONVOLUTION  OF 
THE  SPACE-TIME  RADAR  SPECTRUM 

Armando  B.  Barreto 

Introduction 

The  availability  of  continuously  increasing  computational  power  has  enabled  the  enhancement  of 
state-of  -the-art  radar  systems  which  are  now  capable  of  processing  the  radar  signals  in  both  the  domains  of 
space  and  time.  The  radar  data  collected  through  a  modern  radar  system  incorporating  an  antenna  array  with 
N  elements  and  using  M  pulses  in  each  Coherent  Pulse  Interval  (CPI)  are  typically  arranged  in  a  data 
structure  known  as  the  “CPI  Datacube”  { 1 }  .  Figure  1  shows  such  arrangement.  This  figure  also  shows  a 
simplified  summary  of  the  rationale  behind  radar  processing  algorithms.  The  data  from  each  M  by  N  set  of 
samples  associated  with  a  given  range  gate,  or  “Space-Time  Snapshot”  are  processed  by  a  certain 
transformation  to  give  an  indication  of  the  likelihood  of  the  presence  of  a  valid  target.  If  the  estimate  of  the 
likelihood  of  a  target  being  present  at  a  given  range  fulfills  certain  criteria  the  radar  system  will  indicate  the 
presence  of  a  target  at  that  range. 


1 

N 

N  Elements 

l 

1  M  Pulses /CPI  M 

Figure  1.  The  CPI  Datacube  and  the  target  detection  process 

In  the  absence  of  unwanted  components  of  the  radar  signal,  the  power  spectral  density,  or  space-time 
spectrum  of  a  space-time  snapshot,  obtained  as  the  two-dimensional  FFT  transformation  of  the  space-time 
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snapshot,  would  provide  a  good  indication  of  the  presence,  relative  velocity  and  angle  of  a  valid  target  at 
the  range  under  study. 

In  a  realistic  operational  scenario,  however,  there  will  be  a  number  of  unwanted  components  in  the 
data  contained  in  the  Datacube,  which  will  also  have  an  expression  in  the  result  of  the  two-dimensional  FFT 
and  can  potentially  disrupt  the  target  detection  process.  The  predominant  types  of  those  unwanted 
components  are  returns  form  irrelevant  backgrounds,  or  clutter ,  and  jamming  signals  purposely  radiated  to 
overwhelm  and  obscure  the  returns  from  valid  targets  { 5 } . 

There  are  three  important  characteristics  of  the  process  by  which  the  presence  of  targets,  clutter  and 
jamming  express  themselves  in  the  space-time  spectrum: 

1)  The  space-time  spectral  manifestation  of  an  elementary  target  (a  target  that  exists  at  one  azimuth 
angle  and  has  one  relative  velocity  with  respect  to  the  radar  system)  has  a  predictable  format,  which 
depends  on  the  operational  specifications  of  the  radar  system:  The  number  of  antenna  elements  in  the  array, 
N,  and  the  number  of  radar  pulses  per  CPI. 

2)  The  space-time  spectral  manifestation  of  unwanted  components  in  the  radar  signal  do  not 
necessarily  follow  the  same  format  as  for  an  elementary  target. 

3)  The  expressions  for  targets,  clutter  and  jamming  in  the  pace-time  spectrum  for  a  given  range  gate 
appear  combined  in  a  additive  fashion. 

To  enhance  the  capabilities  of  the  radar  system  to  identify  valid  targets  exclusively  several  approaches 
can  be  taken.  The  current  Space-Time  Adaptive  Processing  (STAP)  techniques  aim  at  the  elimination  of  the 
interference  components  in  the  space-time  spectrum  by  the  application  of  two-dimensional  filters  to  the 
space-time  snapshot  to  place  nulls  in  the  areas  of  space-time  spectrum  where  the  interferences  express 
themselves.  Given  the  variability  of  the  interference  these  two-dimensional  filters  need  to  be  recalculated 
periodically,  in  a  data-adaptive  manner  that  requires  the  repeated  estimation  of  the  characteristics  of  the 
interference.  The  alternative  approach  to  the  identification  of  valid  targets  proposed  in  this  report  is  the 
reversion  of  the  process  that  transforms  the  information  (velocity,  azimuth  and  power)  of  a  clearly  specified 
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target  into  its  expression  in  the  space-time  spectrum.  This  process  of  deconvolution  of  spectral  expression 
of  valid  targets  will  be  beneficial  to  the  detection  process  in  that  it  will  result  in  a  sharper  and  more 
identifiable  representation  for  each  one  of  the  valid  targets  contained  in  the  space-time  snapshot. 
Furthermore,  since  the  deconvolution  process  is  based  on  the  spectral  representation  of  a  valid  target  only, 
forms  of  interference  not  conforming  to  this  model  will  not  be  represented  in  the  deconvolved  result, 
providing  some  partial  or  total  elimination  of  those  unwanted  components. 

Methodology 

The  process  of  deconvolution  of  the  space-time  spectrum  proposed  here  is  based  on  the  abstraction  of 
the  whole  process  leading  to  the  space-time  spectrum  of  a  snapshot  containing  one  or  several  targets  as  a 
linear,  shift-invariant,  two  dimensional  transformation,  HNM{t(4>,  COy)},  applied  to  an  abstract  ‘Target  Field”, 
t(<p,  o^).  This  ‘Target  Field”,  t(<p,  cou)  is  the  discrete  representation  of  the  presence  /  absence  and  power  of 
elementary  targets  at  any  given  azimuth  <j>  and  normalized  Doppler  frequency  coy,  within  the  ranges  (same 
as  ranges  covered  by  the  ordinary  space-time  spectrum): 

-1  <  sin(<(>)  <  1 
-0.5  <  ecu  <  0.5 

It  should  be  noted  that  this  ‘Target  Field”  is  an  abstract  model  synthesized  from  two  physical 
quantities:  The  relative  velocity  of  any  targets  represented  in  the  space-time  snapshot  (as  indicated  by  the 
corresponding  Doppler  frequency)  and  the  azimuth  of  the  targets.  Thus,  if  a  single  target  of  unit  power 
exists  in  the  space-time  snapshot  for  a  given  range,  at  an  azimuth  <j>],  moving  at  a  velocity  represented  by  a 
Doppler  coa,,  then  the  only  non-zero  value  in  the  whole  Target  Field  will  be  an  impulse  located  at  (<f>i,  oui), 
i.e.,  t(4>,  cou)  =  8  (fo,  aui) . 

The  proposed  linear,  shift-invariant  transformation,  HNM{t(<t>,  ecu))  models  the  complete  process 
involved  in  the  operation  of  the  multi-element  radar  system  (which  defines  the  data  contained  in  the  space- 
time  snapshot  under  analysis  from  the  abstract  Target  Field),  and  the  generation  of  the  corresponding 
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space-time  spectrum  in  terms  of  the  Power  Spectral  Density  for  any  given  Doppler  and  azimuth,  PSD((<f>, 
cob): 

PSD(<j),  ay)  -  H\m{  t(4>,  ay)  } 

Under  the  assumption  of  linearity  and  shift-invariance  of  HNM{ },  PSD(<j>,  ay)  can  be  obtained  as  the 
circular  convolution  of  the  Target  Filed  and  the  impulse  response  associated  with  HNM{ } ,  h(4>,  cati): 

PSD(4>,  ay)  =  t(4>,  ay)  ®  h(<t>,  ay) 

Figure  2  shows  this  whole  process  for  a  Target  Filed  with  a  single  elementary  target  and  the  impulse 
response  of  a  system  with  6  antenna  elements  and  6  pulses  per  CPI  (N=6,  M=6). 


Space-Time  Spectrum 

PSD(<t>,cod) 


ccy 


Impulse  response  of  HNm{} 

h(<f>,ce>d) 


Target  Field 

t(<{),COd) 


Figure  2.  Relationship  between  the  Target  Field,  The  Hnm{ }  transformation  and  the  space-time 
spectrum  under  the  convolution  model. 
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Furthermore,  in  the  context  of  discrete  Target  Fields  and  discrete  space-time  spectra,  both  of  them 
with  finite  support ,  the  circular  convolution  operation  representing  the  application  of  HNM{ }  to  t(<[>,ceb)  can 
be  put  in  terms  of  a  matrix  product  of  the  row-scanned  target  filed,  pre-multiplied  by  an  appropriate  matrix 
H[i,j]  to  result  in  a  vector  representing  the  row-scanned  time-space: 

H(k2),(K2)  t(K2xl)  =  PSD(K2jtI) 

In  this  equation  K  is  the  assumed  number  of  discrete  values  for  both  dimensions  of  the  Target  Field 
and  the  space-time  spectrum.  The  [i,j]  entries  of  the  matrices  involved  in  the  above  equation  represent  the 
following: 

t[p(N-l)  +q,l]:  the  q*  element  of  the  p*  row  of  the  original  N  by  M  target  field  matrix 

PSD[p(N-l)  +q,l]:  the  q1*1  element  of  the  p*  row  of  the  original  N  by  M  space-time  matrix 

H[i,j]:  the  contribution  to  the  PSD  in  the  Doppler,  azimuth  combination  represented  by 

the  [i,  1]  element  of  the  vector  PSD,  due  to  an  assumed  elementary  target  of  unit  power  located  at  the 
Doppler  /  azimuth  combination  represented  by  the  0.1]  element  of  the  vector  t. 

The  functional  definitions  of  the  elements  in  the  matrix  formulation  of  the  convolution  process  that 
yields  the  space-time  spectrum  from  the  Target  Field  is  particularly  useful  in  the  context  of  the  problem  at 
hand.  This  is  because  the  H  matrix  can  be  generated  by  a  numerical  process  that  starts  with  the  simulation 
of  the  space-time  spectrum  for  an  elementary  target  at  zero  Doppler  and  zero  azimuth  : 

Hnm°  =  H{8($,cq,)} 

Hnm°  can  be  obtained  as  a  K  by  K  matrix,  simulating  that  target  configuration,  without  any 
interference,  using  the  MATLAB  functions  developed  by  Y.  Seliktar,  from  the  Georgia  Institute  of 
Technology,  assuming  N  antenna  elements  and  M  pulses  per  CPI  {2}.  Then  an  augmented  matrix  can  be 
formed  by  placing  9  replicas  of  HNM°  in  a  3  by  3  arrangement  and  each  i*  row  of  the  required 
transformation  matrix  H  can  be  found  by  selecting  a  K  by  K  submatrix  from  the  augmented  matrix, 
chosen  such  that  the  origin  of  the  central  replica  of  HNM°  matches  the  position  of  the  element  represented  by 
the  [i,  1]  element  of  the  vector  t  and  then  row-scanning  it  into  a  single  row. 
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Casting  the  circular  convolution  process  in  the  form  of  a  matrix  product  has  the  specific  advantage  that 
it  provides  an  immediate  representation  for  the  deconvolution  process,  i.e.  the  process  that  regenerates  the 
Target  Filed  from  knowledge  of  the  space-time  spectrum  and  the  transformation  H{  }  that  links  them. 

An  immediate  approach  to  the  numerical  implementation  of  the  deconvolution  process  would  be, 
assuming  that  the  original  transformation  matrix  H  is  non-singular: 

H1  PSD  =  t 

This  implementation  of  deconvolution  would  also  be  very  computationally  efficient,  since  the  bulk  of 
the  computation  would  be  required  for  the  one-time  calculation  of  H'\  given  the  fact  that  H  is  only  a 
function  of  the  known  characteristics  of  the  radar  system  (N  array  elements,  M  pulses  per  CPI).  Thus,  H"1 
could  be  calculated  in  advance  and  the  prospective  real-time  implementation  of  the  deconvolution  would 
only  require  the  product  of  a  K2  by  K2  matrix,  times  a  K2  by  1  vector  (the  PSD  from  the  space-time 
snapshot).  Unfortunately,  this  is  an  unconstrained  solution  that  will  commonly  model  PSD  components  that 
should  be  attributed  to  single  positive  targets  as  caused  by  configurations  of  multiples  targets,  some  of 
them  negative,  that  can  also  explain  the  PSD  features  in  question.  Figure  3  shows  the  example  of  a  target 
field  obtained  by  H'1  deconvolution  from  a  PSD  calculated  for  a  simulated  Target  Filed  with  only  one  target 
at  zero  Doppler  and  zero  azimuth.  The  presence  of  excess  components,  some  of  them  negative  is  evident. 


Figure  3.  Example  of  results  obtained  through  ff!  Deconvolution,  a)  PSD,  b )  Target  Field 
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According  to  the  initial  formulation  of  the  abstract  Target  Field,  its  values  can  only  represent  the 
presence  (t(<|>,Cfln)  >  0  )  or  absence  (t(<j),atj)  =  0  )  of  a  target  at  the  specified  Doppler  frequency  and 
azimuth.  Therefore,  a  deconvolution  result  which  implies  a  target  field  with  negative  components  does  not 
adjust  to  the  modeling  of  the  problem.  This,  in  turn,  indicates  that  we  should  look  for  a  solution  mechanism 
in  which  the  positivity  constraint  can  be  imposed  over  the  solution.  The  initially  proposed  solution  to  the 
system 

H  t  =  PSD 

Requires  that  the  following  be  strictly  verified: 

II  H  t  -  PSD  II  =  0 

Alternatively,  the  implementation  of  the  deconvolution  of  the  space-time  spectrum  can  be  approached 
as  the  constrained  minimization  of  II  H  t  -  PSD  II,  subject  to  the  condition  that  all  the  elements  of  the 
resulting  solution  PSD  be  positive  or  zero.  Numerical  methods  for  such  type  of  constrained  minimization 
have  been  developed  for  the  solution  of  problems  in  diverse  fields.  Specifically,  Lawson  and  Hanson  {6} 
have  developed  a  solution  to  the  “Non-Negative  Least  Squares”  (NNLS)  problems  that  carries  out  this 
constrained  minimization.  Their  text  demonstrates  that  the  convergence  of  the  minimization  process  is 
guaranteed,  although  both  the  time  required  and  the  accuracy  of  the  solution  may  vary,  according  to  the 
specifics  of  the  matrix  and  vector  involved. 

Figure  4  shows  the  results  of  constrained  minimization  deconvolution  of  a  space-time  spectrum 
calculated  for  a  single  simulated  target  at  30°  azimuth  and  150  Hz  Doppler.  The  simulation  of  the  single 
target  and  the  definition  of  the  H  matrix  assumed  the  use  of  six  antenna  elements  (N  =  6)  and  six  pulses  per 
CPI  (M  =  6).  In  this  case  the  space-time  spectrum  obtained  from  the  Georgia  Tech  MATLAB  ®  routines 
had  to  be  decimated  from  256  by  256  values  to  32  by  32,  so  that  matrix  sizes  and  computational  time  were 
practical.  It  should  be  noted  that  even  after  this  reduction  in  the  accuracy  of  the  representation  for  the  target 
its  velocity  and  azimuth  are  indicated  clearly  in  the  deconvolved  Target  Field. 
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Figure  4.  Example  of  constrained  minimization  deconvolution  on  simulated  target  data, 
a)  Original  PSD,  b)Deconvolved  Target  Field 

Results 

In  this  section  of  the  report  results  of  the  deconvolution  process  as  applied  to  space-time  radar  data 
will  be  shown,  as  evidence  of  the  feasibility  of  this  new  approach  to  the  enhancement  of  signal  processing 
systems  to  retrieve  information  about  valid  targets.  Initially,  the  results  for  sets  of  synthetic  data  will  be 
shown.  These  data  were  simulated  using  the  “simit”  MATLAB  ®  utility  developed  by  Y.  Seliktar  of 
Georgia  Tech  {2}.  In  reach  case  the  synthesis  of  data  was  carried  out  under  controlled  conditions  (Target, 
clutter  and  jamming  information  known  and  M  and  N  values  pre-specified).  In  each  case,  the  space  time 
spectrum  was  also  obtained  using  a  customized  version  of  “dispit”  MATLAB  ®  utility  by  Y.  Seliktar, 
saving  the  magnitude  of  the  space-time  spectrum  as  a  256  by  256  matrix  of  real  values  .  After  decimation 
to  32  by  32  for  ease  and  speed  of  implementation,  the  spectrum  matrix  was  row-scanned  into  the  column 
vector  PSD,  used  in  the  constrained  optimization.  Next,  results  are  shown  of  the  application  of  space-time 
spectral  deconvolution  to  real  space-time  data  from  the  Common  Research  Environment  for  STAP 
Technology  (CREST)  Data  Library,  collected  under  the  Mountain  Top  Program  (7).  The  author  thanks 
and  acknowledges  the  assistance  and  access  provided  by  the  CREST  office  at  the  Maui  High-Performance 
Computer  Center. 


*  Results  from  synthetic  data: 


In  all  the  results  from  synthetic  data  the  general  settings  for  the  simulation  were  as  follows: 

•A  single  target  with  a  power  of  30  dB  was  simulated  at  -30°  Azimuth  and  150  Hz  Doppler 
•The  Doppler  Range  was  from  -300  Hz  to  300  Hz. 

•The  orientation  of  the  “clutter  ridge”  was  determined  by  a  platform  speed  of  100  m/s 
•When  a  jamming  source  was  used,  it  was  simulated  at  0°  azimuth 

•The  parameters  N  =  6  and  M  =  6  were  used.  The  spectrum  was  downsampled  to  32  by  32  (K  =  32) 


Target  and  low  clutter  and  jamming:  Figure  5  shows  two  views  of  the  space-time  spectrum  for  a 
scenario  in  which  the  power  of  both  clutter  and  jamming  were  simulated  at  10  dB. 


Figure  5.  Deconvolution  of  PSD  including  a  target  and  low-power  clutter  and  jamming,  a),  b)  3-D 
and  contour  plots  of  original  spectrum,  c)  d):  3-D  and  contour  plots  of  deconvolved  Target  Field. 


5-11 


Figure  5  also  shows  two  views  of  the  resulting  Target  Field  obtained  by  constrained  minimization.  It 
should  be  noted  that,  under  these  conditions  of  relatively  low  clutter  and  jamming,  the  deconvolution 
correctly  represents  the  velocity  and  azimuth  of  the  single  target  simulated  and  minimizes  the  representation 
of  clutter  and  jamming.  These  unwanted  components,  in  fact,  are  only  barely  modeled  in  Doppler,  azimuth 
combinations  were  their  spectra  intersect,  resembling  the  spectrum  of  a  small  valid  target. 

Target  and  strong  clutter:  Figure  6  shows  the  results  of  deconvolution  when  the  power  of  the 
simulated  clutter  was  increased  to  60  dB  and  no  jamming  was  included.  This  experiment  exemplifies  the 
capability  of  deconvolution  to  minimize  the  representation  of  clutter  in  the  Target  Filed,  given  the  fact  that 
it  typically  has  a  spectral  manifestation  with  a  structure  different  to  that  of  a  valid  target. 


sin(azimuth  « 

- 1  It  SinianmulM 


Figure  6.  Deconvolution  of  PSD  including  a  target  and  clutter,  a),  b)  3-D  and  contour  plots  of  original 
spectrum,  c)  d):  3-D  and  contour  plots  of  deconvolved  Target  Field. 
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Target  and  Jamming  and  low-power  clutter:  Figure  7  shows  the  results  of  space-time  spectrum 
deconvolution  when  clutter  with  a  power  level  of  lOdB  and  jamming  interference  with  a  power  level  of 
30dB  were  simulated  along  with  the  target.  The  overwhelming  presence  of  the  jammer  is  evident  in  both 
views  of  the  space-time  spectrum.  In  this  case,  the  spectral  expression  of  the  jamming  resembles  a 
succession  of  the  typical  spectral  patterns  of  valid  targets,  arranged  along  a  line  of  constant  azimuth  (0°). 
Accordingly,  the  deconvolution  process  modeled  the  jammer  as  a  series  of  targets  at  0°  azimuth,  obscuring 
the  retrieval  of  the  true  valid  target  in  the  simulation. 


Figure  7.  Deconvolution  of  PSD  including  a  target ,  jamming  and  low-power  clutter,  a),  b)  3-D  and 
contour  plots  of  original  spectrum,  c)  d):  3-D  and  contour  plots  of  deconvolved  Target  Field. 
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*  Results  from  real  radar  data:  In  order  to  verify  the  validity  of  the  deconvolution  method  to  retrieve 


an  accurate  representation  of  the  Doppler  and  azimuth  of  targets  sensed  through  a  real  radar  system,  some 
files  from  CREST  Data  Library  have  been  processed.  Real  data  were  also  used  to  confirm  the  observations 
regarding  the  decreased  sensitivity  of  the  deconvolution  process  to  clutter  components  of  the  space-time 
spectrum. 

Radar  Data  including  a  moving  target  simulator  and  IDPCA  clutter:  The  following  two  experiments 
show  the  results  of  constrained  minimization  deconvolution  on  space-time  spectra  of  data  collected 
through  the  Radar  Surveillance  Technology  Experimental  Radar  (RSTER)  using  the  Inverse  Displacement 
Phase  Center  Array  (IDPCA)  at  WSMR,  New  Mexico.  Although  the  Array  is  physically  at  a  fixed  location, 
the  sequencing  of  the  transmitted  pulses  in  the  IDPCA  produces  clutter  returns  equivalent  to  those  sensed 
by  a  moving  antenna  array.  The  Internal  Test  Target  Generator  in  the  RSTER  was  used  to  generate  a  target 
signal  in  the  record. 


I38pre0tv  l.CP'6  -  lOPCA  clutter  *  ASTER  internal  target  generator 


Deconvolved.  From  i38pe<i0ivl.CPI6  -  processed  wi  KemUitj 


Figure  8.  Deconvolution  of  file  t38pre01v.cpi6.mat.  a),  b)  Original  spectrum,  c)  d):  Target  Field. 
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Doppler 


Figure  8  shows  the  original  space-time  spectrum  and  the  deconvolved  Target  Field  for  data  file 
t38pre01vl.cpi6.mat .  Here  again,  the  original  256  by  256  space-time  spectrum  had  to  be  downsampled  to 
32  by  32  to  reduce  the  computational  load  of  the  solution.  The  spectral  manifestation  of  the  target  is 
discernible  in  the  original  space-time  spectrum,  but  it  is  much  more  clear  and  localizable  in  the  deconvolved 
Target  Field.  There  is  virtually  no  evidence  of  a  “clutter  ridge”  in  the  deconvolved  Target  Field.  Figure  9 
shows  the  results  for  a  similar  data  file:  t38-04vl.cpil.mat.  Here  the  deconvolved  Target  Field  is  almost 
exclusively  defined  by  a  non-zero  target  element,  corresponding  to  the  identifiable  Doppler  and  azimuth  of 
the  discernible  target  in  the  original  space-time  spectrum,  with  no  representation  of  clutter. 


Figure  9.  Deconvolution  of  file  t38-04vl.cpil.mat.  a),  b)  Original  spectrum,  c)  d):  Target  Field. 
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Doppior 


The  final  example  of  deconvolution  results  from  real  data,  shown  in  Figure  10,  illustrates  the  level  of 
resistance  of  the  deconvolution  process  to  represent  unwanted  spectral  components  that  do  not  conform  to 
the  pattern  of  valid  targets.  The  file,  mmit003vl.cpil.mat,  does  not  contain  targets.  It  only  contains  “Hot 
clutter”  and  light  effects  of  a  jammer,  which  have  already  been  mitigated  in  the  original  space-time 
spectrum.  The  results  of  constrained  minimization  deconvolution  show  a  Target  Field  comprised  of  several 
disorganized  and  very  low-power  components  (the  heights  of  the  elements  in  the  3-D  Target  Field  plot  are 
much  smaller  than  those  of  targets  in  Figures  8  and  9.) 
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Discussion 


The  results  of  constrained  minimization  deconvolution  shown  in  the  previous  section  confirm  the 
potential  of  this  technique  to: 

a)  Accurately  model  the  presence  of  valid  targets  in  the  Target  Field,  clearly  defining  their  azimuth 
and  their  velocity,  as  represented  by  their  Doppler  frequency. 

b)  Provide  only  a  limited  representation  of  space-time  spectrum  components  that  do  not  conform  to 
the  spectral  patterns  of  valid  targets,  such  as  clutter. 

It  should  be  noted  that  the  above  experiments  were  carried  out  on  space-time  spectra  downsampled 
from  their  original  256  by  256  size  to  a  reduced  size  of  32  by  32.  This  was  done  to  bring  the  amounts  of 
storage  and  processing  cycles  involved  to  levels  manageable  by  the  system  used  for  this  project.  The  impact 
of  this  simplification  was  less  pronounced  in  the  synthetic  data  experiments,  where  the  number  of  antenna 
elements,  N,  and  the  number  of  pulses  per  CPI,  M,  were  both  set  at  a  value  of  six.  It  is  interesting  to  note 
that,  for  the  purpose  of  deconvolution,  the  wider  sidelobes  in  Doppler  and  azimuth  that  result  from  these 
low  values  of  N  and  M  actually  contribute  to  a  good  definition  of  the  target  spectral  pattern  in  a 
downsampled  spectrum  (K  =  32).  In  contrast,  spectra  of  real  space-time  snapshots  obtained  from  the 
CREST  Data  Library,  recorded  using  N  =  14  and  M=16,  experienced  significant  loss  of  detail  through 
downsampling,  relative  to  the  size  of  the  lobes  of  the  target  spectral  pattern.  These  limitations  in  both  the 
spectra  under  analysis  and  the  impulse  response  used  to  form  the  H  matrix  required  for  the  deconvolution 
influenced  the  appearance  of  spurious  non-zero  values  around  the  main  components  of  the  recovered  Target 
Fields. 

The  capability  of  the  constrained  minimization  deconvolution  to  partially  avoid  the  representation  of 
clutter  was  evidenced  with  both  synthetic  and  real  (IDPCA)  clutter.  This  feature  of  the  process  is  attributed 
to  the  fact  that  the  spectral  manifestation  of  clutter  is,  in  general,  different  from  that  of  a  valid  target,  which 
is  the  one  model  that  is  being  used  for  the  reconstruction  of  the  Target  Field.  On  the  other  hand, 
experiments  involving  synthetic  jamming  showed  that  the  characteristic  spectral  pattern  of  barrage 
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jamming,  spanning  all  frequencies  along  a  single  azimuth  line,  is  interpreted  by  the  deconvolution  process 
as  the  superposition  of  a  series  of  targets  with  different  velocities  but  lined  up  in  a  single  angle. 

While  the  deconvolution  process  does  not  appear  to  be  able  to  discard  jamming  interference  in  the 
reconstruction  of  the  Target  Field,  its  ability  to  accentuate  the  presence  of  valid  targets  and  significantly 
reduce  the  representation  of  clutter  makes  it  suitable  for  its  integration  to  partial  adaptive  algorithms  that 
excel  in  removing  uni-directional  jamming  interference  {8}.  Specifically,  spectral  deconvolution  could  be 
proposed  as  post-processing  to  follow  standard  beamforming  techniques  (adaptation  in  space  only). 

In  considering  the  merits  and  potential  of  this  new  form  of  space-time  radar  data,  it  must  be  kept  in 
mind  that  this  approach  attempts  to  make  an  efficient  use  of  the  combined  features  in  space  and  time  of 
data  samples  captured  in  a  single  space-time  snapshot.  This  is  in  sharp  contrast  with  most  fully-  or 
partially-adaptive  space-time  processing  methods  for  airborne  radar  {1},  {3},  {4}.  In  most  of  those 
approaches  the  required  knowledge  about  the  interference  and  the  target  is  made  accessible  by  resorting  to 
the  processing  of  augmented  sets  of  data.  For  example,  the  estimation  of  the  interference  covariance  matrix 
Ru,  used  for  the  definition  of  the  STAP  weight  vector  that  is  appropriate  for  a  single  space-time  snapshot 
requires  the  processing  of  a  number  of  snapshots  “surrounding”  the  snapshot  under  study.  This  “Sample 
Support”  around  the  snapshot  of  interest  may  involve  2MN  to  5MN  auxiliary  snapshots,  each  containing 
MN  space-time  samples,  under  the  assumption  that  the  interference  is  effectively  stationary.  So,  calculation 
of  a  single  weight  vector,  optimized  for  a  specific  combination  of  range,  expected  target  Doppler  frequency 
and  expected  azimuth  may  require  manipulation  of  a  matrix  Ru  of  size  (MN)3  by  (MN)3,  in  the  fully 
adaptive  case  {!}.  Additionally,  in  the  absence  of  a  priori  knowledge  about  the  expected  velocity  and 
azimuth  of  the  target  or  targets,  a  number  of  different  weight  vectors  need  to  be  calculated  and  applied  to 
each  space-time  snapshot  under  study,  to  explore  the  possibility  of  finding  a  target  in  each  Doppler  / 
azimuth  cell  of  interest  within  the  whole  space-time  spectrum. 

In  the  context  described  above,  it  may  be  suggested  that  this  new  approach  to  the  detection  of  relevant 
targets  in  space-time  radar  data  may  be  used,  after  standard  adaptive  beamforming  to  minimize  the  effects 
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of  barrage  jamming,  to  indicate  the  Doppier  /  azimuth  combinations  that  may  contain  a  target,  if  any. 
Since  the  process  of  deconvolution  may  be  carried  simultaneously  with  any  of  the  other  fully-  or  partially- 
adaptive  STAP  methods,  this  form  of  preselection  of  Doppler  /  Azimuth  ceils  to  study  could  expedite  the 
determination  of  presence  or  absence  of  a  target  in  a  given  space-time  snapshot. 

Conclusions 

This  report  has  presented  an  exploration  into  the  proposition  of  an  alternative  approach  to  the 
definition  of  targets  from  space-time  radar  data.  Most  of  the  current  space-time  adaptive  processing 
methods  concentrate  in  the  development  of  multidimensional  filters  aimed  at  establishing  nulls  in  the 
areas  of  the  space-time  spectrum  where  the  interferences  lie.  In  order  to  achieve  this  goal  extensive 
knowledge  about  the  interference  type  is  needed,  which  is  ordinarily  estimated  from  additional  data.  In 
contrast,  the  proposed  deconvolution  approach  is  based  on  the  knowledge  available  about  the  space-time 
spectral  manifestation  of  a  valid  target,  minimizing  the  representation  of  some  forms  of  interference,  such 
as  clutter,  precisely  because  their  spectral  manifestation  does  not  conform  to  the  model  sought  (the  spectral 
pattern  associated  with  a  valid  target).  The  shift  of  the  focus  in  this  alternative  approach,  from  finding  out 
and  de-emphasizing  the  interference,  to  finding  out  and  emphasizing  the  target  prompts  to  the  potential 
application  of  space-time  spectral  deconvolution  as  a  parallel  method  to  other  STAP  algorithms,  within  a 
comprehensive  detection  system. 

In  this  project  an  implementation  for  the  deconvolution  of  discrete  space-time  radar  spectra  was 
proposed.  This  implementation  is  developed  as  the  constrained  minimization  of  the  norm  of  the  difference 
between  the  two  sides  of  the  equality  relating  the  abstract  ‘Target  Field”  and  the  space-time  spectrum  of 
radar  data  contained  in  a  single  snapshot.  The  proposed  implementation  was  used  to  prove  the  feasibility 
and  potential  of  the  deconvolution  concept  with  both  synthetic  and  real  data.  In  both  cases  the  method 
showed  the  ability  to  properly  represent  valid  targets  while  significantly  de-emphasizing  clutter.  On  the 
other  hand,  the  method  was  only  able  to  de-emphasize  the  representation  of  jamming  interference  of  low 
power.  Accordingly,  it  was  suggested  that  for  a  realistic  scenario  an  adaptive  beamforming  pre-processing 
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block  aimed  at  canceling  jamming  would  greatly  benefit  the  performance  of  the  deconvolution  approach.  It 
is  important  to  note,  however,  that  while  jamming  interference  may  be  very  powerful  and  overwhelm  a 
target  detection  system,  it  is  still  a  “one-dimensional”  interference  (single  azimuth).  On  the  other  hand, 
space-time  deconvolution  appears  to  be  useful  against  some  forms  of  interference  that  are  truly  two- 
dimensional  in  nature,  such  as  the  clutter  ridges  that  appear  in  spectral  configurations  (e.g.  diagonals)  that 
effectively  span  a  full  range  of  Doppler  frequencies  and  azimuth  values. 
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Abstract 


The  usefulness  of  wavelets  in  digital  and  signal  processing  has  been  widely  accepted  and  appreciated  for  the 
past  several  years.  Also,  the  need  for  data  compression  as  well  as  the  ability  to  train  networks  via  radial  basis 
functions  has  been  known  for  the  past  decade.  This  research  was  conducted  for  the  purpose  to  ascertain  the  benefits 
of  using  wavelets  for  data  compression  followed  by  the  use  of  a  radial  basis  function  to  determine  if  data  fusion  in  the 
context  of  infared  and  passive  millimeter  wave  imaging  can  benefit  the  identificatiion  of  potential  targets  as  well  as 
discriminate  between  a  non-target.  The  study  showed  that  data  compression  can  be  attained  while  retaining  a  high 
percentage  of  image  characteristics  of  a  target.  Also,  it  is  conjectured  that  data  fusion  most  likely  enhances  target 
identification.  However,  further  study  needs  to  be  udertaken  to  determine  under  what  conditions  data  fusion  is  most 
successful.  Finally,  further  investigation  needs  to  be  done  to  determine  how  well  a  non-target  can  be  discriminated 
from  a  non-target  using  neural  networks. 
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THE  USE  OF  WAVELETS  AND  NEURAL  NETWORKS  IN  DATA  COMPRESSION  AND  DATA  FUSION 
WITH  RESPECT  TO  TARGET  IDENTIFICATION 

LARRY  A.  BEARDSLEY 


Introduction 

In  the  recent  past,  the  need  for  data  compression  has  been  recognized  in  many  different  applications.  One 
of  the  most  well-known  uses  of  data  compression  is  in  its  value  to  fingerprinting.  As  of  June,  1994,  the  FBI  had  about 
30  million  sets  of  fingerprints  [  5].  This  number  has  substantially  increased.  These  need  to  be  digitized.  For  example, 
when  a  thief  is  apprehended,  fingerprints  are  recorded  electronically.  A  central  computer  looks  for  a  match.  Until  the 
early  1990’s,  this  process  was  time  consuming  and  this  would  be  costly  to  the  chances  of  apprehending  of  the  criminal 
in  time  that  could  be  valuable.  In  the  beginning,  it  was  expected  that  older  Fourier  methods  would  succeed  here. 
However,  with  a  compression  ratio  of  20:1 ,  the  fingerprint  ridges  could  not  be  easily  followed  due  to  the  fact  that  lines 
were  broken  between  one  eight-by-eight  square  and  the  next.  Also,  the  short-timed  Fourier  transform  introduced  too 
much  blocking.  With  the  used  of  wavelets,  the  image  can  be  compressed  at  the  given  ratio  of  20:1  with  little  loss  of 
information. 

With  the  above  applications  and  even  earlier  ones,  it  was  conjectured  by  prominent  individuals  in  academia, 
industry,  and  the  defense  agencies,  that  there  may  be  widespread  advantages  to  military  applications.  Since  time  is  of 
the  essence  when  imaging  a  military  target  with  either  passive  millimeter  wave  or  infared  or  both,  the  less  the  amount 
of  data  to  be  processed  to  obtain  virtually  the  same  target,  the  better.  Since  the  advent  of  the  Co'rflet  and  Daubechie 
wavelet,  great  work  has  been  accomplished  in  many  applications,  and  data  compression  is  one  of  these. 

For  the  past  year,  the  effect  of  using  wavelets  in  military  applications  has  been  studied.  This  study  was 
undertaken  to  determine  if  wavelets  were  beneficial  in  target  identification  with  less  data.  Also,  later  in  the  research 
period,  the  study  of  neural  nets  and  radial  basis  functions  was  underaken  to  see  how  it  benefited  in  target  recognition 
using  the  two  methods  of  imaging,  passive  milliter  wave  and  infared.  No  strong  results  are  presented  in  this  paper  with 
respect  to  this  last  venture  due  to  time  constraints.  However,  the  value  of  the  use  of  neural  nets  to  train  for  target  and 
non-target  is  not  to  be  minimized.  A  detailed  summary  report  of  results  of  compression  of  images  and  the  effects  of 
data  fusion  on  the  given  images  using  PCNN  is  to  be  presented  to  the  laboratory  along  with  this  final  report.  Due  to  the 
classification  of  the  images  and  the  number  of  them,  they  are  not  included  in  this  report.  Also,  more  detailed 
suggestions  are  included  in  that  follow-up  report. 

Methodology 

Several  military  targets  were  provided  by  the  laboratory.  These  targets  were  in  the  form  of  what  is  commonly 
called  “raw  format”  or  “tif  format" .  The  “tif’  formatted  files  could  be  opened  using  microsoft  word.  However,  the  wavelet 
analysis  could  not  be  performed  in  this  format.  Yet,  the  raw  format  could  be  manipulated  using  matlab  and  a  wavelet 
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toolbox  to  perform  both  imaging  and  data  compression.  Also,  results  could  be  derived  using  matlab  programming  to 
determine  the  percentage  of  the  image  retained  as  well  as  generating  histograms  of  the  wavelet  coifficients  which  in 
turn  gave  useful  information.  Also,  programming  was  done  to  decompress  the  data.  The  results  of  data  compression 
were  very  positive  yielding  high  compression  ratios  while  retaining  the  “energy”  of  the  given  picture;  that  is,  the  image 
was  virtually  the  same  after  data  compression. 

The  last  aspect  of  the  study  took  place  with  only  a  few  weeks  of  the  researcher’s  tenure  at  the  laboratory. 
So,  conclusive  results  were  not  found.  However,  using  a  method  called  PCCN,  “pulse-coupled  neural  networks”,  to 
segment  the  data,  it  is  conjectured  that  PCCN  can  be  used  in  the  fusion  of  the  two  methods  of  imaging  yielding 
enhanced  results  in  particular  cases.  Under  what  conditions,  fusion  is  favorable  needs  to  be  studied  as  well  as  the 
accuracy  that  target  and  non-target  can  be  discriminated. 

The  Experiment 

Original  infared  and  mmw  images  were  compressed  using  both  the  Daubechie2  and  Daubechie6  wavelet 
to  compress  the  entire  images.  Then,  PCNN  was  run  on  the  entire  image.  Outlines  of  the  regions  of  interest  (ROI)  of 
possible  targets  were  generated.  Then  a  wavelet  histogram  routine  was  used  to  find  the  50  (or  25,  or  other  depending 
on  what  was  called  for)  on  the  largest  coefficients.  A  window  was  created  around  the  region  of  interest.  Then,  an  RBF 
algorithm  “was  to  be  used"  to  train  a  radial  basis  function  using  the  given  coefficients  or  the  the  wavelet  histograms. 
The  next  step  was  to  do  this  for  all  N-1  histograms  for  training.  Then,  results  were  to  be  kept  track  of  based  on 
probably  of  hit  P(Hit)  and  P(miss). 
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THE  USE  OF  WAVELETS  AND  NEURAL  NETWORKS  IN  DATA  COMPRESSION  AND  DATA  FUSION  WITH 

RESPECT  TO  TARGET  IDENTIFICATION 

The  appearance  of  wavelets  in  mathematics  is  not  so  recent  in  mathematics  but  it  is  fairly  recent  in  its 
application  to  engineering,  in  particular  to  digital  signal  processing.  The  word  wavelet  or  “ondelette"  was  first  known  to 
have  been  used  by  the  French  researchers,  Morlet  [1}and  Grossman  [  2  ],  [3],  in  the  early  1980’s.  However,  the 
existence  of  wavelet-like  functions  has  been  known  since  the  early  part  of  the  20th  century,  the  first  examples  being 
the  Haar  wavelet  and  the  Littlewood-Payley  wavelet.  Many  of  the  ideas  formulated  using  wavelets  have  their  origins  in 
work  in  subband  coding  in  engineering,  coherent  states  and  renormalization  group  theory  in  physics  and  the  study  of 
Calderon-Zygmund  operators  in  mathematics.  However,  it  was  not  until  recently  that  the  unifying  concepts  necessary 
for  a  general  understanding  of  wavelets  were  provided  by  researchers  such  as  [5]  Stromberg,  Meyer,  Mallat, 
Daubechies,  and  Lemarie.  Since  that  time,  the  growth  of  applications  of  wavelets  has  been  colossal  with  numerous 
authors  contributing  significantly  to  the  research. 

To  give  the  reader  an  appreciation  for  how  wavelets  play  a  role  in  data  compression,  it  seems  appropriate 
that  a  short  introduction  to  wavelets  be  given  in  this  paper. 

The  wavelet  transform  is  a  tool  that  cuts  up  data,  functions  or  operators  into  different  frequency  components, 
and  studies  each  component  with  a  resolution  matched  to  its  scale.  For  example,  in  signal  analysis,  the  wavelet 
transform  yields  a  time  history  in  terms  of  its  frequency  components.  In  this  one  respect,  the  wavelet  transofrm 
performs  in  much  the  same  way  as  the  Fourier  transform.  The  Fourier  transform  extracts  from  the  signal  details  of  the 
frequency  content,  but  loses  all  information  on  the  location  of  a  particular  frequency  within  the  signal.  A  partial  solution 
is  attempted  by  windowing  the  signal  and  then  taking  its  Fourier  transform.  The  problem  with  windowing  is  that  the 
slice  of  the  signal  that  is  extracted  is  always  the  same  length.  As  a  result,  the  slice  or  number  of  data  points  used  to 
resolve  a  high-frequency  component  is  the  same  as  the  number  used  to  resolve  a  low-frequency  component.  The 
wavelet  transform’s  solution  to  this  is  that  it  adapts  the  width  of  its  time  slice  according  to  the  frequency  components 
being  extracted.  For  example,  for  a  windowed  Fourier  transform  expressed  as 

(Tf)(w,t)=  J/(5)g(5 -Oe^ds, 

then  the  corresponding  wavelet  transform  is 

(TBvf)(a,b)=  f/(Oq>(— )dt 

J  y  a  ) 

where  the  position  of  the  slice  of  the  signal  that  the  wavelet  samples  in  time  is  controlled  by  V  and  the  extent  of  the 
slice  by  the  parameter  ’a”.  Wavelets  are  being  used  in  signal  processing  to  develop  best  basis  methods  for  signal 

compression  and  signal  characterization. 
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THE  CONSTRUCTION  OF  WAVELETS  AND  THE  RELATION  TO  DATA  COMPRESSION 


The  construction  of  wavelets  begins  with  vectors.  We  begin  with  the  concept  that  every  two-dimensional 
vector  (x,y)  is  a  combination  of  the  basis  vectors  (1,0)  and  (0,1).  For  example,  the  vector  (4,3)  can  be  obtained  by 
multiplying  the  first  basis  vector,  (1,0)  by  4  and  the  second  basis  vector,  (0,1),  by  3.  In  a  like  manner,  ail  possible 
vectors  can  be  obtained  by  using  the  same  two  basis  vectors.  The  basis  vectors  are  not  unique,  but  the  number  of 
basis  vectors  for  a  given  dimension,  “n”,  must  be  also  be  “n”.  The  best  bases  have  another  valuable  property, 
orthogonality.  For  example,  the  standard  basis  for  R2  includes  the  vectors  (0,1)  and  (1,0) ,  and  of  course  their  dot 
product  is  0. 

Considering  four  dimensions  where  we  take  the  four  components  (x,  ,  x2 ,  x3 ,  x4  )  as  the  strengths  of  the 
signal,  if  the  signal  is  x  =  (1 ,1 ,1 ,1 ),  the  tone  is  steady  whereas  if  x  =  (1 ,2,4,8),  the  volume  is  increasing.  An  easy  way 
to  send  a  signal  x  is  to  give  its  four  components,  You  are  then  using  the  basis  for  R4  with  basis  vectors  (1 ,0,0,0), 
(0,1, 0,0),  (0,0, 1,0),  and  (0,0, 0,1) ,  and  the  numbers  x, ,  x2 ,  x,  and  x4  are  sent.  You  may  question  how  this  could  be 
less  than  optimal.  The  answer  depends  on  the  signal  being  transmitted. 

Compression  discards  basis  vectors  that  are  absent  or  barely  present  in  a  signal.  When  the  vectors 
represent  different  frequencies,  the  Fourier  basis,  high  frequencies  can  often  be  discarded.  With  the  standard  basis, 
the  vector  (0,0, 0,1)  can  be  discarded,  for  example  when  the  fourth  vector  is  not  used.  It  was  novel  in  the  early  1900’s 
to  devise  a  basis  that  included  the  constant,  or  scaling  vector,  (1, 1,1,1)  and  the  second  basis  vector  (1,1, -1,-1),  up 
once  and  down  once.  Alfred  Haar,  the  German  mathematician,  completed  a  perpendicular  baisis  in  1910  by  dilation 
and  translation  (ie.  squeezing  and  shifting).  He  squeezed  the  vector  (1,1, -1,-1)  to  produce  the  third  vector  (1,-1 ,0,0). 
Then,  he  shifted  the  third  vector  by  two  time  intervals  to  produce  the  fourth  vector  (0,0, 1,-1).  These  basis  vectors  are 
the  columns  of  the  Haar  matrix. 

'1  1  10" 

11-10 
Hj=  1-10  1 

1  -1  0  1 

To  find  the  amplitudes  for  longer  signals,  averages  and  differences  are  used.  The  averages  move  up  in  a 
pyramid  to  be  averaged  again.  The  differences  give  the  fine  details  at  each  level.  Compression  works  by  ignoring 
what  the  eye  cannot  see  and  the  ear  cannot  hear.  So,  for  example,  when  a  signal  changes  quickly,  it  is  considered 
unimportant;  when  a  signal  is  almost  steady,  the  signal  may  not  be  transmitted.  The  concept  being  developed  here  is 
that  of  what  is  called  multiresolution.  We  see  a  signal  at  multiple  scales,  fine,  medium,  and  coarse.  The  signals  are 
resolved  at  each  scale  by  details  and  averages. 
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HOW  MULTIRESOLUTIONAL  ANALYSIS  WORKS? 

Before  undertaking  how  multiresolutional  analysis  works,  it  may  be  helpful  to  first  look  at  how  to  develop  a 
multilevel  representation  of  a  function.  The  goal  of  multiresolution  analysis  is  to  develop  representations  of  a  function 
f(x)  at  various  levels  of  resolution.  We  achieve  this  by  expanding  the  given  function  in  terms  of  basis  functions  <t>(x) 
which  can  be  scaled  to  give  multiple  resolutions  of  the  original  function.  In  order  to  develop  a  multilevel  representation 
of  a  function  in  L2(R)  we  seek  a  sequence  of  embedded  subspaces  V(  such  that 

{0}...cV,cV1cV,cVr..cLJ(R) 


with  the  following  properties: 

(a)  |J  j  eZ  V  |  is  dense  in  La(R).  (where  Z  is  the  set  of  integers.) 

(b)  p|;eZVj={0} 

(c)  The  embedded  subspaces  are  related  by  a  scaling  law 

g(x)eVi  <=>  g(2x)  e  V  M 

(d)  Each  subspace  is  spanned  by  integer  translates  of  a  single  function  g(x)  such  that 

g(x)  e  V0  <=>  g(x  + 1 )  e  V# 

We  also  seek  a  scaling  function  <t>(x)  e  V0  such  that  its  integer  translatges  { <|>(x  -  k) ,  k  e  Z}  from  a  Riesz 
basis  for  the  space  V0 .  Then  4>(2x  -  k)  form  a  basis  for  V,  (Scaling  by  powers  other  than  two  is  possible  but  is  not 
discussed  here.)  Therefore, 

V0  =  span{  4>(x  -  k) ,  k  e  Z} 

V,  =  spanj  4>{2x  -  k) ,  k  e  Z) 

Since  the  space  V0  lies  within  the  space  V, ,  we  can  express  any  function  in  V0  in  terms  of  the  basis  functions 

k= co 

of  V, .  In  particular,  4>(x )  =  £  ak  0(2*  -  k)  in  which  a  k,  k€  Z  is  a  square  summable  sequence.  (*) 

it=-00 

We  will  refer  to  equation  (*)  as  the  dilation  equation.  The  dilation  equation  is  often  referred  to  as  the  refinement 
equation  or  the  scaling  relation. 

If  we  now  define  4>  rok(x)  =  2m/2  <t»(2"x-  k) ,  then  $  mk(x),  keZ  forms  a  Riesz  basis  for  the  space  Vm 
The  dilation  parameter  m  will  bae  called  the  scale. 

An  example  using  the  box  function  as  a  basis  (commonly  know  as  the  Haar  basis). 
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This  example  will  be  used  to  demonstrate  the  validity  of  our  assumptions.  Refer  to  figure  1  (a). 


Figure  1.  (a)  The  Haar  seating  function  £(x)  m<p0  0.(b)  Translations  of  the  Haar  scaling  function  <*0. ,  and 

(c)  Dilations  of  the  Haar  scaling  function  t,a  aad  0.la r  (d)  The  Haar  wavelet  *(x)  b  Ko 


Figure  Z  Successive  approximations  to  a  function,  (a) 
(cl  Detail  at  scale  4,  CLf.  <d) 


fm  $m(2nx)sra(4jtx).  (b)  Approximation  at  scale  4.  PJ 
Approximation  at  scale  5,  Paf 


Let  <t>(x)  = 


0<x<l 

otherwise 


From  the  figure,  it  is  evident  that  {<}>(x  -  k) ,  k  e  Z}  is  an  orthonormal  set.  We  can  now  approximate  a  function  f  e  L2(R) 
by  its  projection  P0  f  onto  the  space  V0 : 


k= co 

P0f=  ]Tco.* 

lc=-co 


In  general,  a  function  may  be  approximated  by  its  projection  Prof  onto  the  space  Vm: 
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k-  co 

Pmf=P0f=  Om.t(x) 

k=- oo 

In  fact  Pmf  approaches  f  as  m  ->  oo 

Figures  2  on  p,  7  and  Figure  3  below  show  two  approximations  to  a  function  at  consecutive  scales.  Note  that  the  box 
function  satisfies  the  dilation  equation  with  coefficients  a0  =  a ,  =  1,  and  <(>(x)  =  <j>(2x)  +  <f>(2x  - 1) ,  and  from  this  we 
can  see  that 
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Figure  3.  (a)  Hair  scaling  function,  (b)  Hur  wavelet,  (c)  Daubechies  D4  scaling  function,  (d)  Daubechies  D4  wavelet, 

(e)  Daubechies  D6  scaling  function,  (f)  Daubechies  D6  wavelet,  (g)  Battfe-Lemartt  scaling  function,  (h)  Battle- Lonaric 

wavelet 

V^V,  czV2... 

This  example  shows  that  the  box  function  can  be  used  to  develop  a  sequence  of  embedded  spaces  with  each  space 
spanned  by  the  translates  of  the  box  function  at  that  scale. 

Now  ,  multiresolution  decomposition  takes  the  expansion  coefficients  cmk  of  an  approximation  Pm  f,  to  a 
function  at  scale  m  and  decomposes  them  into 

(1 )  the  expansion  coefficients,  c^  of  the  approximation  P^,  f  at  the  next  coarser  scale  m  -1 , 
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(2)  the  expansion  coefficients,  dm_,  k  of  the  detail  component  Qm1f  =  Pmf  -  P^f . 


For  details  of  the  algorithm  for  this  decomposition  refer  to  Mallat  [  6  ].  Basically,  the  above  process  is  repeated  with 
respect  to  the  expansion  coefficients  cm.,  k  of  the  approximation  Pm.,f  to  find  the  coefficients  c^  and  dm.ak ,  and  so  on. 
The  effect  of  multiresolution  decomposition  is  therefore  the  breaking  down  of  the  L2  space  into  a  series  of  orthogonal 
subspaces  at  different  resolutions. 

WHAT  DO  WAVELETS  HAVE  TO  DO  WITH  DATA  COMPRESSION? 

The  discrete  wavelet  transform  (DWT)  and  the  fast  Fourier  transform  (FFT)  have  similitaries  that  have 
practical  application.  The  DWT  and  the  FFT  are  both  linear  operations  that  generate  a  data  structure  containing  log2n 
segments  of  various  lengths,  usually  filling  and  transforming  it  into  a  different  data  vector  of  length  2" .  The 
mathematical  properties  of  the  two  transforms  are  also  similar.  For  example,  the  inverse  transform  matrix  for  both  the 
FFT  and  the  DWT  is  the  transpose  of  the  original.  As  a  result,  both  transforms  can  be  viewed  as  a  rotation  in  function 
space  to  a  different  domain.  Concerning  the  FFT,  the  new  domain  contains  basis  functions  that  are  sines  and  cosines, 
whereas  for  the  wavelet  transform,  the  new  domain  contains  more  complicated  basis  functions  called  wavelets,  mother 
wavelets,  or  analyzing  wavelets.  Another  similarity  that  both  transforms  have  is  that  the  basis  functions  are  localized  in 
frequency,  making  mathematical  tools  such  as  power  spectra  and  scalegrams  useful  at  picking  out  frequencies  and 
calculating  power  distributions. 

It  is  the  most  interesting  dissimilarity  between  the  wavelet  transform  and  the  fast  Fourier  transform  that  lends 
itself  to  usefulness  in  data  compression,  feature  detection  in  images,  and  noise  removal  from  time  series.  Individual 
wavelet  functions  are  localized  in  space;  fourier  sine  and  cosine  functions  are  not .  This  localization  feature  and  the 
wavelets’  localization  of  frequency  makes  many  functions  and  operators  using  wavelets  “sparse”  when  transformed 
into  the  wavelet  domain.  It  is  this  sparseness,  in  turn,  which  makes  wavelets  viable  for  the  aforementioned 
applications. 

An  advantage  of  wavelet  transforms  is  that  the  windows  vary.  One  would  like  to  have  some  very  short  basis 
functions  in  order  to  isolate  signal  discontinuities.  Also,  at  the  same  time,  in  order  to  obtain  detailed  frequency 
analysis,  one  would  like  to  have  some  very  long  basis  functions.  A  way  to  achieve  this  is  to  have  short  high-frequency 
basis  functions  and  long  low-frequency  basis  functions.  This  is  exactly  the  happy  medium  that  wavelet  transforms 
affords  us.  The  main  point  here  is  that  wavelet  transforms  do  not  have  a  single  set  of  basis  functions  like  the  Fourier 
transform,  consisting  of  sine  and  cosine  functions,  but  rather  an  infinite  set  of  possible  basis  functions.  Therefore, 
wavelet  analysis  offers  us  immediate  access  to  information  that  can  be  obscured  by  other  time-frequency  methods 
such  as  Fourier  analysis. 

In  the  next  section,  a  brief  review  of  the  mathematics  behind  wavelets  will  be  given,  but  I  deemed  it 
appropriate  to  offer  a  brief  description  of  what  wavelets  look  like  and  their  characteristics.  Wavelet  transforms 
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comprise  an  infinite  set,  and  different  wavelet  families  yield  different  trade-offs  between  how  compactly  the  basis 
functions  are  localized  in  space  and  how  smooth  they  are.  Many  of  the  wavelet  bases  have  fractal  structure  of  which 
the  Daubechies  wavelet  family  is  an  example.  Within  each  family  of  wavelets  are  wavelet  subclasses  distinguished  by 
the  number  of  coefficients  and  by  the  level  of  iteration.  This  is  an  extra  set  of  mathematical  relationships  which  the 
coefficients  must  satisfy  and  is  directly  related  to  the  number  of  coefficients.  [  7  ] 

The  key  to  the  wavelet  analysis  procedure  is  to  adopt  a  wavelet  prototype  function,  called  a  mother  wavelet. 
Temporal  analysis  is  performed  with  a  contracted,  high-frequency  version  of  the  protype  wavelet,  while  frequency 
analysis  is  performed  with  a  dilated,  low-frequency  version  of  the  same  wavelet.  Since  the  original  signal  or  function 
can  be  represented  in  terms  of  a  wavelet  expansion  (using  coefficients  in  a  linear  combination  of  the  wavelet  functions, 
data  operations  can  be  performed  using  just  the  corresponding  wavelet  coefficients.  If  you  desire  further  to  choose  the 
wavelets  best  adapted  to  your  data  or  truncate  the  coefficients  below  a  threshold,  your  data  are  sparsely  represented. 
It  is  this  sparse  coding  that  makes  wavelets  an  excellent  tool  in  the  field  of  data  compression. 

HOW  ARE  FILTER  COEFFICIENTS  DERIVED  FROM  GENERALIZED  SCALING  FUNCTIONS  AND  WAVELETS? 

From  a  heuristic  standpoint  to  aid  in  a  pedagogical  environment,  the  Haar  scaling  function  and  Haar  wavelet 
previously  described  are  very  useful.  However,  from  an  applied  standpoint,  particularly  to  digital  signal  processing,  the 
Haar  wavelet  is  not  very  useful.  Their  graphs  are  made  from  flat  pieces  consisting  of  1’s,  0’s,  and  -I’s,  and 
approximation  to  most  signals  is  poor  at  best.  Many  flat  pieces  are  needed  to  represent  even  a  sloping  line  to  decent 
accuracy.  The  Haar  basis  will  not  give  compression  ratios  desired  such  as  20:1  minimally  or  100:1.  The  key  to  the 
problem  is  a  better  basis. 

Actually,  the  Haar  basis  and  scaling  function  described  earlier  are  special  cases  of  a  more  general  class  of 
functions.  In  general,  a  scaling  function,  O  (x),  is  the  solution  to  a  dilation  equation  of  the  form 


jt=°o 

&(x)=  £ak  9(Sx-k)  0) 

k=-x 

A  good  choice  of  the  dilation  factor  is  S  =  2,  in  which  case  the  equation  becomes 

k- oo 

®0)=  0(2  x-k)  (2) 

k=- co 

The  constant  coefficients  a*  are  called  filter  coefficients  and  it  is  often  the  case  that  only  a  finite  number  of  these  are 
non-zero.  The  filter  coefficients  are  derived  by  imposing  conditions  on  the  scaling  function,  0(x) .  One  condition  is 

that  the  scaling  function  and  its  translates  should  form  an  orthonormal  set;  that  is, 
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(3) 


|  <J)(a:)<I>(x  +  T)dx  =  §  0  ^  t  eZ 

—30 

.  B  ri,  i=o 

where  8  0  j  =  J 

[0,  otherwise 

A  wavelet,  ¥  (x),  is  orthogonal  to  the  scaling  function  and  is  defined  by 

£=co 

*(x)«  (4) 

£=-C0 

where  N  is  an  even  integer.  This  definition  satisfies  orthogonality  since 

k= oo  £=a> 

<<&(*),¥(*)>  =  L  ®(2*-*)  £(-»  ^  aN.,.  £  <D(2x-/) dx 

A:=-co  £=-oo 

=  0. 

The  set  of  coefficients  {ak}  and  {(-1)kaN1  ,K }  are  said  to  form  a  pair  of  quadrature  mirror  filters. 

Figure  3  depicts  the  Haar  scaling  function  and  Haar  wavelet,  Daubechies  4  scaling  function  and  Daubechies  4 
wavelet,  Daubechies  6  scaling  function  and  Daubechies  6  wavelet,  and  the  Coiflet  2  scaling  function  and  Coiflet  2 
wavelet  respectively.  These  were  obtained  by  using  different  sets  of  filter  coefficients  to  solve  equations  3  and  4 
above. 

Now  for  the  derivation  of  the  coifficients.  In  order  for  the  scaling  function  to  provide  a  useful  basis  for 
functional  analysis,  it  should  satisfy  useful  properties  which  lead  to  corresponding  conditions  on  the  filter  coefficients. 
These  properties  are: 

(I)  To  uniquely  define  all  scaling  functions  of  a  given  shape,  the  area  under  the  scaling  function  is  normalized  to  unity; 
that  is, 


oo 

j<D(x>fe  =  1. 

-00 


(4’) 


Also,  the  fact  that  the  scaling  function  has  a  non-vanishing  integral  forces  the  sum  of  the  filter  coefficients  to  be  equal 

£=oo 

to  2;  that  is,  a  k  =  0.  (If  this  is  not  obvious,  employ  the  “trick”  of  multiplying  the  scaling  function  <J)(x)  by  2  and 

£=- oo 

integrating  by  2  to  obtain 
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03  k= co 


2  J 0(x)cfcc  =  2  J  ^ak  0(2 x-k) 

—oo  —co  k—~<c 

k- co  ® 

=  £ak  J <E>(2jc - *)^(2jc - *) 

*=-«  -» 

£= oo 

£«.  ■  2- 


(5) 


k=-oo 

k=co 


The  uniqueness  of  OO)  is  guaranteed  by  ]Tak  =  2.  However,  a  smooth  solution  does  not  necessarily  follow. 


&=-C0 


As  an  example,  let  c0  =  2  with  O(jt)  =  8  (x).  Then  8  (x)  =  28  (2x),  and  yet  the  delta  function  is  not  smooth. 

For  a  scaling  function  to  be  orthogonal  to  its  integer  translates,  the  filter  coefficients  must  satisfy  the  additional 


CO 

requirement  that  J  0(x)0(jc  +  i)dx  =  8  0  a  t 


el 


k- co 

This  forces  the  condition  u=  28  0  ^  t  el 


(6) 


Jfc=-oo 


To  determine  a  unique  set  of  filter  coefficients,  one  more  condition  other  than  (5)  and  (6)  needs  to  be  satisfied.  In  an  N 
coefficient  system,  the  two  equations  yield  a  total  of  N/2  +1  equations.  For  a  unique  solution,  another  N/2  - 1 
equations  are  necessary.  To  obtain  the  remaining  equations,  it  was  discovered  by  Daubechies  [  4]  that  we  can 
require  the  scaling  function  to  be  able  to  exactly  represent  polymonials  of  order  up  to,  but  not  greater  than  p. 
Enforcement  of  this  requirement  results  in  the  compactly  supported  wavelets  developed  by  Daubechies.  This 
requirement  for  approximation  of  order  p  is  that  any  function  of  the  form 


f(x)  =  a„  +  a,x  +  a/  +  a/  +  •  •  +  a, 


Vi 


k- co 


can  be  exactly  represented  by  an  expansion  of  the  form  f(x)  =  ^aK<t>(x-k) . 


(7) 

(8) 


k=-<c 


Another  way  of  saying  this  is  that  the  polynomials  1  ,x . x1’'1  are  linear  combinations  of  the  translates, 

OO  -  k) .  This  now  may  be  translated  as  a  condition  on  the  wavelet.  Taking  the  inner  product  of  the  equation  in  (8) 

with  v  (x)  yields 

{f(x),'¥(x))=  a  k  (OO  -  k),  TO))  -  0. 

k=- CD 

Using  this  result  with  (7)  and  (8),  we  arrive  at 

co  00  =0 

a,  J  '¥(x)dx  +  a,  |  W(x)xdx  +  a2+  . .  +  J  TO)  x"1  dx. 

—oo  -co 

Since  this  identity  is  valid  for  all  a(  ( j  =  0,1,2,...,  p-1),  choosing  a=  1  and  letting  all  other  a,  =  0 
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(9) 


*  c 

yields  [  'F(jc)  x  dx  =0,  ^  =  0,1,2 . p-1 

-oo 

So,  the  first  p  moments  of  the  wavelet  must  be  zero..  To  determine  the  constraints  this  places  on  the  filter  coefficients, 
substitute  equation  (4)  into  (9).  From  this,  it  can  be  shown  that 

5  (-1)kakk^*0,/=0>112. . p-1  (10) 

k=-co 

It  was  shown  earlier  that  equation  (1 0)  must  yield  n/2  - 1  new  equations.  Now,  the  equation  obtained  by  letting  I  =  0  is 
redundant  since  it  may  be  obtained  from 

-y/2 [equaion(6)]i .  o  +  4 [equation(6)]i  >  o  -  [equation(5jf  (11) 

Equation  (11)  yields  p-1  new  equations  which  implies  from  (9)  that  p  =  N/2.  So,  the  final  conditions  on  the  filter 
coefficients  is 

£  (-1)kakk  ^  =0,  t  -0,1, 2,..., N/2  -1  (12) 

£=-co 

Thus,  the  filter  coefficients  (aKk  =  1 ,...,  N  - 1}  for  an  N  coefficient  system  are  uniquely  defined  by  (5),  (6),  and  (12). 

The  table  below  gives  the  coefficients  for  the  Daubechies  D$,  D6,  and  D8  wavelet  systems.  The  final  problem  that  has 
to  be  constructed  is  the  actual  scaling  function  ®(x)  from  the  fillter  coefficients. 


Table  I.  Filters  coefficients  for  Daubechies  DN  filter  where  N  =  4,  6,  and  8  (rounded  to  nearest  ten 
thousandth). 


k 

D4 

D6 

D8 

0 

0.6830 

0.4704 

0.3258 

1 

1.11830 

1.4144 

1.0109 

2 

0.3170 

0.6504 

0.8922 

3 

-0.1830 

-0.1909 

-0.0396 

4 

-0.0450 

0.2645 

5 

0.0498 

0.0436 

6 

0.0465 

7 

-0.0150 
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How  are  the  scaling  functions  constructed? 


Generally,  scaling  functions  do  not  have  a  closed  form  solution;  rather,  they  have  to  be  generated  recursively 
from  the  dilation  equation  (2).  This  equation  may  be  explicitly  written  as 

0>(x)  =  ao<&(2x)  +  ai<D(2jt  -  l)+...+a*  -  iG>(2x  -N+l) 

Writing  this  relationship  for  all  integer  values  x  =  j,  the  following  set  of  equations  can  be  generated. 

<X>(0)  =  ao<D(0) 

0(1)  =  aoO(2)  +  a, 0(1)  +  a:O(0) 

0(2)  =  aoO(4)  +  aiO(3)  +  ^0(2)  +  aiO(l)  +  ^0(0) 


0(A?-2)  =  an-  30 (N - 1)  +  as  -  2<£(iV  -  2)  +  as  -  .O (N -  3) 

0(iV"  -\)  =  as-  0(iV  - 1) 

The  matrix  form  of  the  above  can  be  generated  and  may  be  written  as  M  O  =  O . 

So,  the  vector  of  integer  values  of  the  scaling  function,  O ,  is  the  eigenvector  of  M  corresponding  to  the  eigenvalue  1 . 
As  in  all  eigenvalue  problems,  the  solution  of  the  equation 

(M -1)0  =  0.  (13) 

is  not  unique,  and  therefore  a  normalizing  condition  is  needed  in  order  to  determine  a  unique  eigenvector.  In  this  case 
the  normalizing  condition  arises  from  equation  4’.  After  some  calculation,  we  obtain 

£  o(i)  =  1, ie z  (14) 

k=-CQ 

Therefore,  the  values  of  the  scaling  function  at  the  integers  are  given  bhy  the  solution  to  equation  (13)  and  normalized 
by  equation  (1 4).  So,  we  have  how  to  find  the  integer  values  of  ‘x’;  the  values  of 
<J>(x)  of  the  half  integers  can  be  found  by  equation  (2).  That  is, 

0>(x/2)=  ]Tak<!>(x-fc),  (15) 

k--co 

So,  this  process  is  repeated  as  many  times  as  necessary  to  find  the  values  of  ®(x)  at  all  dydadic  points 
{i/2n  ,neZ) 

How  to  Calculate  the  Wavelet  Coefficients  for  a  particular  Example. 

An  example:  The  Daubechies  4  coefficient  wavelet  system 
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We  will  demonstrate  how  to  use  the  previous  results  to  construct  the  D4  (Daubechies  4  coefficient)  scaling 
function  and  wavelet. 

(1)  First  construct  the  filter  coefficients. 

So,  we  want  to  solve  equations  (5),  (6),  and  (12).  The  easiest  way  todo  tihis  is  to  include  the  redundant 
equation  for  l  =  0  in  equation  (6),  which  we  know  is  linear,  and  to  exclude  the  non-linear  equation  for  i  =  lin  equation 
(12).  Therefore,  we  have 

3o  +a1+a2  +aj  =  2 
a02+a,2+a22a33=2 
a0-a1+a2-a3=O 
-a0  +a,+2a2  -3aj  =  0 

Now,  from  the  above  linear  equations,  we  have: 
a^a,.  -1/2,  a2=3/2-a,,a3=1  -a, 

Substituting  for  a  0 ,  ^  ,  and  a,  in  the  quadratic  equation  and  solving  gives 
3  ±  73 

a.  = - ,  and  from  this  we  find  that 

4 

1±V3  3  +  V3  1  +  73 


It  is  obvious  that  one  set  of  solutions  is  the  antithesis  of  the  other.  That  is,  if  one  leads  to  the  scaling  function  O  (x), 
then  the  other  will  lead  to  O  (-x).  Yet,  both  equations  satisfy  the  condition  of  normality,  orthonormality  and  exact 
representation  of  a  linear  function.  The  solution  that  will  be  adopted  is  the  one  with  the  upper  of  “plus-minus  or  minus- 
plus”  in  the  above  notation. 

The  values  of  the  scaling  function  at  the  integer  points  are  given  by  equation  (1 3): 


Go  - 1  0  0  0  0(0)  0 

Cli  Go  - 1  Go  0  0(1)  0 

0  <23  Go  - 1  Gi  0(2)  0 

0  0  0  Go  -  J  |_0(3)J  [o. 


From  this,  we  find  that  0(0)=  0,  0(3)  =  0,  0(2)  = 


1-73 

1  +  73 


0(1) 
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Substituting  this  into  equation  (14)  yields 


4*1)  =  and  4*2)=  '-f- 


Equation  (15)  then  gives  us  the  values  of  O(x)  at  the  half-integers. 

=  3,4*1)=^ 


o^J=a3<D(2)+a2o(i)=o 

2-S 

Now,  the  Mallat  [6]  transform  decomposition  algorithm  or  another  can  be  used  to  operate  on  the  coefficients 
for  data  compression. 
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1  Introduction 


The  research  described  in  this  report  was  conducted  with  the  objective  of  studying  and  suggesting 
ways  to  improve  the  performance  of  Air-to-Ground  Target  Recognition  systems  that  use  High 
Resolution  Radar  (HRR)  data.  Specifically,  we  worked  with  the  HRR  data  for  five  target  vehicles. 

During  the  first  phase  of  research  we  performed  a  number  of  investigations  with  the  objective 
of  learning  about  the  nature  and  characteristics  of  the  HRR  data.  Our  perspective  during  these 
studies  was  to  determine  the  suitability  of  various  types  of  features  for  the  classification  of  target 
vehicles. 

The  insights  provided  by  these  studies  led  us  to  hypothesize  that  ATR  performance  can  po¬ 
tentially  be  improved  if  instead  of  uniform-width  templates,  sets  of  variable-width  templates  are 
constructed  from  data.  We  developed  a  heuristic  search  based  algorithm  for  constructing  variable- 
width  templates  and  demonstrated  the  potential  improvement. 

In  the  following  sections  we  first  list  the  results  of  the  studies  performed  with  the  data  and 
then  outline  our  algorithm  and  results  for  constructing  variable-width  templates.  We  also  describe 
a  hierarchical  attributes  based  recognition  system  for  which  only  some  preliminary  work  was 
initiated  during  this  study. 

2  Nature  of  Data 

Our  studies  were  performed  using  synthetic  data  generated  by  the  XPATCH  system.  A  brief 
description  of  the  data  is  as  follows.  A  transmitted  signal  containing  100  different  frequencies  is 
beamed  at  the  target  vehicle  and  the  reflected  energy  is  collected.  This  reflected  energy  depends 
on  the  shape  of  the  reflection  characteristics  of  the  area  from  which  it  is  reflected.  A  plot  of  the 
reflected  energy  for  the  100  pulses,  the  ( signature ),  contains  information  about  the  characteristics 
of  the  reflecting  area.  The  data  consists  of  signatures  obtained  from  every  0.04  degree  of  each 
target  vehicle.  This  gives  us  a  100  x  9000  matrix  of  complex  numbers  for  each  target  vehicle  and 
for  a  fixed  elevation  angle.  We  worked  with  the  data  for  5  target  vehicles.  These  vehicles  were: 
Ml  tank,  T-72  tank,  School  Bus,  Fire  Truck,  and  Scud  Missile. 

3  General  Approach 

The  three  phases  of  the  ATR  process  are:  (i)  Pre-processing  of  data;  (ii)  Data  Reduction  and 
Representation;  and  (iii)  Pattern  Recognition.  In  the  following,  we  discuss  the  prevalent  methods, 
details  of  studies  performed  by  us  and  the  results  of  modifications  implemented  by  us. 

3.1  Pre-Processing  of  Data 

Most  prevalent  methods  transform  the  signatures  from  frequency  domain  into  time  domain  using 
Fourier  Transform  methods  and  work  with  the  resulting  Range-Profile  data.  In  our  studies  we 
used  the  magnitude  part  of  such  Range-Profile  data.  We  also  performed  a  power  transform  of  the 
data  before  using  it  so  as  to  restore  the  normal  distribution  of  the  values  that  gets  altered  during 
the  process  of  computing  magnitude  of  complex  numbers.  A  plot  of  one  range  profile  is  shown  in 
Figure-1.  Figure-2  shows  the  intensity  plot  for  the  9000  range  profiles  of  one  target. 
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Figure  1 :  A; 


3.2  Data  Reduction  and  Representation 

The  data  that  needs  to  be  stored  for  each  target  consists  of  one  array  of  100  x  9000  complex 
numbers  for  each  target,  and  for  each  angle  of  elevation.  This  is  an  enormous  amount  of  data. 
There  is  a  large  amount  of  redundancy  in  this  data  which  arises  due  to  the  fact  that  signature  of 
a  vehicle  varies  only  gradually  as  we  change  the  rotational  and /or  the  elevation  angle. 

In  current  ATR  methods,  this  redundancy  is  exploited  by  constructing  templates  of  fixed-sized 
angular  regions  around  a  vehicle.  Typically.  180  templates  of  2-degree  angular  width  each  may  be 
constructed  for  a  vehicle.  Within  each  such  template  fifty  contiguous  range-profiles  are  averaged 
to  obtain  the  representative  template  signature.  During  the  recognition  stage,  a  signature  is 
matched  to  these  templates  to  find  the  closest  template.  Figure-3  shows  the  intensity  plot  of 
range  profiles  for  T-72  (for  90-180  degrees  region)  and  also  a  similar  plot  for  3-degree  templates 
in  this  region.  Representation  of  9000  range  profiles  for  a  target  by  120  or  180  template  signals 
is  significant  reduction  in  the  amount  of  data  to  be  stored. 

3.3  Recognition 

The  Prevalent  Method  is  to  determine  the  distance  between  the  sample  (to  be  classified)  range- 
profile  and  each  of  the  template  signals  and  find  the  closest  template.  Sample  is  then  classified 
as  belonging  to  the  target  of  the  closest  template. 

4  Preliminary  Investigations 

We  performed  a  number  of  investigations  for  determining  the  characteristics  of  the  HRR  data  for 
the  targets.  Most  of  these  investigations  employed  measurement  of  various  information-theoretic 
quantities  for  the  data  sets.  A  brief  description  of  the  investigations  and  their  results  is  mentioned 
below. 

•  Task  1:  We  computed  informational  entropy  for  each  range  profile  for  the  HRR  data  for 
each  target.  The  calculation  for  the  entropy  was  performed  as  follows.  Take  range  profiles 
Pm, pm  +  1,  and  pm+ 2  for  a  target.  Each  of  these  profiles  is  of  the  type  shown  in  Figure-1 
and  consists  of  100  real  values.  Determine  the  maximum  Max  and  the  minimum  Min 
of  these  three  hundred  values.  Partition  the  Max-to-Min  range  in  25  equal  intervals  and 
determine  the  number  of  data  points  (out  of  300)  lying  within  each  interval.  Compute  the 
entropy 

25 

Em  =  S2  -Pilog{Pi)  (1) 

i=  1 

where  Pi  is  the  probability  of  a  data  point  belonging  to  the  ith  of  the  25  intervals.  This 
value  is  shown  as  the  entropy  for  the  mth  point  and  the  step  is  repeated  for  all  the  9000 
profiles  of  a  target  data.  An  example  of  such  a  plot  for  target  M-l  is  shown  in  Figure-4. 

-  These  investigations  showed  areas  of  each  target  that  are  rich/deficient  in  informational 
entropy.  Interestingly,  each  target’s  data  showed  very  low  entropy  value  around  0,  90, 
180,  and  270  degree  regions.  Intuitively,  it  is  easier  to  distinguish  targets  in  regions 
where  they  possess  high  informational  entropy. 
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T-72  HRR  Image  (90-180  degrees) 
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Figure  3:  Templates  for  T-72  Target  (90- 180  degrees) 
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Figure  5:  Mutual  Information  between  Pairs  of  Targets 
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•  Task  2:  We  compared  the  informational  entropy  of  range  profiles,  the  same  quantity  as 
described  above,  for  all  targets  before  and  after  applying  the  power  transform.  Figure-4 
shows  the  plots  for  the  entropy  values  in  the  two  cases. 

-  It  is  evident  from  these  plots  that  throughout  the  target’s  360  degrees  the  entropy  value 
increases  by  an  almost  constant  amount  as  a  result  of  applying  the  power  transform. 
However,  the  relative  amount  of  information  in  various  regions  of  a  target  remains  the 
same. 

•  Task  3:  Computed  Mutual  information  for  all  pairs  of  targets.  This  study  compared 
regions  of  targets  and  showed  which  regions  were  similar/different  to  each  other  from  the 
perspective  of  informational  entropy.  A  sample  output  is  shown  in  Figure-5. 

5  Seeking  Good  Features 

Based  on  the  insight  provided  by  the  above  studies  we  designed  a  set  of  hierarchical  features  to  be 
extracted  for  the  HRR  Data  set  of  each  target.  An  overview  of  these  features  can  be  summarized 
by  the  three- level  binary  tree  shown  in  the  following  diagram. 


One  such  binary  tree  corresponds  to  one  range-profile  signal  similar  to  the  one  shown  in  Figure- 
1.  Each  such  signal  is  100-points  wide,  we  examine  all  10-point  wide  regions  and  determine  the 
region  with  maximum  amount  “Energy”  in  it.  An  estimated  measure  of  energy  is  obtained  by  the 
product  of  informational  entropy  in  the  10-point  wide  region  and  the  average  value  of  the  signal 
in  the  region.  The  center  cf  this  10  point  wide  region  is  called  the  location  of  the  corresponding 
energy  amount. 

For  a  signal  we  determine  the  location  of  the  maximum  energy  and  assign  it  to  the  node 
number  1  of  the  binary  tree.  The  signal  is  then  partitioned  at  this  point  into  two  parts.  The  Left 
Part  contains  the  signal  that  lies  to  the  left  of  the  10-point  region  of  maximum  energy  and  the 
Right  Part  contains  the  signal  that  lies  to  the  right  of  the  maximum  energy  region.  Determination 
of  regions  of  maximum  energy  and  partitioning  is  recursively  continued  until  the  seven  nodes  of 
the  above  tree  have  been  filled.  It  is  possible  that  many  nodes  of  this  tree  may  remain  unfilled 
due  to  the  nature  of  a  particular  signal. 


This  system  was  fine-tuned  and  binary  trees  determined  for  the  180  2-degree  templates  of 
each  target.  That  is,  data  for  each  target  was  reduced  to  180  template  signals  by  summing  over 
2-degree  regions  and  then  a  binary  tree  was  generated  for  each  resulting  template  signal.  Figure-6 
shows  a  plot  of  these  binary  trees  for  a  target.  This  plot  only  shows  the  locations  where  regions 
of  high  energy  are  found  in  the  consecutive  template  signals.  Solid  line  shows  the  locus  of  the 
location  stored  at  node  number  1  of  the  tree.  Loci  for  the  locations  stored  at  node  numbers  2 
and  3  of  the  tree  are  shown  by  dashed  line;  and  those  for  node  numbers  4  and  7  by  the  dotted 
lines.  The  plot  shown  in  Figure-6  presents  a  very  good  presentation  of  the  main  scatter  centers 
for  the  target.  This  is  a  validation  of  the  fact  that  the  features  capture  good  physical  properties 
of  the  target  from  the  HRR  data.  There  is  much  noise,  however,  in  the  plot  in  Figure-6  and  the 
definitions  of  features  need  to  be  further  refined  before  a  recognition  system  is  finally  constructed. 

The  binary  tree  representation  stores  at  its  nodes  information  about  the  locations  and  amounts 
of  energy  at  the  various  energy-centers  of  a  signal.  This  information  can  be  used  to  synthesize 
new  attributes  at  each  node.  For  example,  a  moment-of-energy  can  be  computed  at  each  node 
based  upon  the  moments- of -energy  at  the  children  nodes.  A  number  of  encouraging  results  were 
obtained  in  this  direction  but  they  also  pointed  towards  an  anomaly  having  its  origins  in  the 
data-reduction  stage. 

5.1  A  Problem  with  Template-Making 

Each  range-profile  is  a  signal  consisting  of  100  values.  In  our  HRR  dataset  a  signal  is  obtained 
after  every  0.04  degree  movement  around  a  target,  and  from  a  fixed  elevation.  This  results  in  9000 
range-profiles  being  collected  for  a  target,  for  an  elevation.  Templates  axe  made  by  Averaging 
contiguous  sets  of  50  profiles  each,  assuming  that  the  shapes  of  signals  change  only  gradually. 

The  assumption  that  the  shapes  of  signals  change  only  gradually  is  largely  true  but  there  are 
numerous  points  where  the  transitions  in  signals’  characteristics  are  sharper  than  the  usual.  When 
the  sharper  transitions  are  included  within  a  templates  and  the  signals  averaged,  the  resulting 
templates  have  larger  variation  from  the  individual  signals  contained  in  the  template.  More 
formally,  let  us  define  ith  range-profile  as  Pi{k)  where  k  takes  the  values  from  1  to  100.  The  mth 
template  tm  is  formed  by  averaging  50  contiguous  range  profiles  as  follows: 

m*50 

tm{k)  =  51  for  k  =  1....100.  (2) 

l=(m— 1)*50+1 

The  ar?r~g?J  template  signals  differs  from  each  of  the  50  individual  signals  of  which  it  is  an 
average.  One  way  to  define  this  error  for  the  i.  range  profile  is  by  determining  the  Square-Error 
as  follows: 

Ei{k)  =  ( abs(Pi{k )  -  tm{k )))2  for  k  =  1....100.  (3) 

The  average  of  the  Square-Error  vector  for  a  template  can  be  computed  as  follows: 

m*50 

MSEm(k)  =  53  £/(fc)  for  k  =  1....100.  (4) 

i=(m— 1)*50+1 

Figure-7  shows  a  template  signal  and  its  corresponding  MSE  signal.  A  measure  of  how  much,  on 
the  average,  a  template  varies  from  its  constituent  signals  can  be  obtained  by  computing  the  area 
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under  the  MSE  curve  shown  in  Figure-7.  This  area  can  be  computed  as: 


100 

ERRORm  =  £MS£m(Z). 

i=  1 


(5) 


5.2  Uniform-Width  Templates 

The  prevalent  method  for  making  templates  is  to  partition  the  entire  angular  region  around  a 
target  into  equal-sized  angular  regions  and  then  average  the  range-profiles  falling  within  the  region 
of  each  template.  For  our  dataset,  if  we  want  to  make  2-degree  templates  then  contiguous  sets 
of  50  profiles  would  be  taken  to  form  template  signals.  The  starting  points  of  these  templates 
would  be  at  profile  numbers:  1,  51,  101,  151  ...8951.  The  average  error  resulting  from  forming 
180  uniform-width  templates  in  this  way  can  be  determined  as: 

180 

Avg  -  MSE  =  l/180(]jr  ERRORm-  (6) 


5.3  Variable-Width  Templates 

Our  insights  into  the  HRR  data  for  target  vehicles  suggested  that  the  ERROR  resulting  from 
averaging  within  a  fixed-width  template  is  high  because  signatures  of  significantly  different  char¬ 
acteristics  get  averaged  together.  That  is,  the  angular  regions  around  a  target  where  the  transition 
in  signal  shapes  is  sharper  than  the  average  gradual  rate  of  change,  the  resulting  template  signals 
have  higher  value  for  ERROR.  The  ERROR-value  can  be  decreased  if  the  boundaries  of  the  180 
templates  are  moved  around  and  readjusted  in  such  a  way  that  the  points  of  sharper  transitions 
form  the  boundaries  between  templates.  The  problem  now  is  to  determine  the  placements  for  the 
boundaries  such  that  the  ERROR  is  minimized. 

5.4  Heuristic  Search  for  Template  Boundaries 

The  problem  of  finding  the  template  boundaries  so  as  to  minimize  the  value  of  ERROR  is  an 
optimization  problem  and  can  possibly  be  formulated  in  a  number  of  different  ways.  After  our 
initial  attempts  at  formulating  it  in  terms  of  some  relaxation  algorithms  did  not  show  much 
promise,  we  formulated  it  as  a  heuristic  search  problem.  Main  steps  of  this  algorithm  are  as 
follows: 

■  1.  Take  a  Window  of  5-degree  width  and  slide  it.  by  changing  one  range  profile  at  a  time,  across 
the  entire  dataset  for  the  target.  Determine  the  value  of  MSE  for  each  5-degree  template  as 
the  window  slides  across  the  target.  Figure-8  shows  a  plot  of  the  MSE  value  for  one  target. 

2.  Rearrange  the  points  according  to  increasing  values  of  their  corresponding  MSE  values. 

3.  Select  a  subset  of  50  template-starting  points  from  the  above  list  such  that: 

(a)  Every  range  profile  gets  included  in  one  or  the  other  template.  (By  a  template  we  refer 
to  all  those  contiguous  range  profiles  that  will  be  averaged  to  obtain  a  representative 
template  signal.) 
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(b)  No  two  templates  overlap. 

(c)  Starting  points  are  chosen  as  much  as  possible  towards  the  beginning  of  the  list  created 
in  step-2. 

A  set  of  points  generated  as  above  constitutes  one  hypothesis  about  the  template  starting 
points.  It  is  possible  to  generate  a  number  of  such  hypotheses.  After  generating  these 
hypotheses  we  have  to  find  the  best  solution  by  suitable  refining  the  most  promising  hy¬ 
potheses  by  operators  that  change  the  staring  points  of  templates.  This  is  achieved  by  the 
following  step  that  performs  a  heuristic  search. 

4.  Run  a  heuristic  algorithm  (modeled  after  A*  search  algorithm)  with  the  following  search 
operators: 

(a)  Expand  a  template  to  include  more  range  profiles. 

(b)  Partition  a  template  into  two  templates  in  such  a  way  that  their  weighted  MSE  is 
minimized.  The  weightings  arc  provided  by  the  width  of  each  partition. 

(c)  Merge  two  adjoining  templates  into  one. 

At  each  step  of  this  algorithm  a  heuristic  function  is  used  to  determine  the  hypothesis  that 
should  be  selected  for  modification  by  one  of  the  above  steps. 

5.  The  algorithm  terminates  when  the  number  of  templates  has  reached  180. 

For  the  case  of  uniform- width  templates  the  value  AVg-MSE  as  defined  above  gives  an  in¬ 
dication  of  “Goodness”  of  the  templates.  The  same  function  can  be  employed  to  determine  the 
similar  “Goodness”  for  the  variable-width  templates  generated  by  our  search  algorithm.  However, 
there  is  one  difference  that  we  must  account  for.  When  the  templates  axe  of  different  widths  their 
contribution  to  the  average  error  for  all  the  templates  should  be  in  proportion  to  their  individual 
widths.  That  is, 

180 

Avg  -  MSE  =  (1/9000)  ( £  {ERRORrr.  *  Widthm)  where  (7) 

71=1 

Widthm  is  the  width  of  the  mth  template.  The  heuristic  functions  chosen  by  us  for  the  above 
described  search  algorithm  were  designed  to  minimize  this  error  function.  It  is  also  possible  to 
alter  these  heuristic  functions  and  minimize  some  other  measure  of  error,  such  as, 

Avg  -  MSE  =  (Maxl^J.ERRORm  *  Widths).  (8) 

5.5  Results  of  Search  Algorithm 

We  plotted  the  values  of  Avg-MSE  for  an  equal  number  of  uniform-width  and  variable-width 
templates  constructed  from  the  same  regions  of  all  targets.  The  plot  for  the  fire-truck  target  is 
shown  in  Figure-9.  It  can  be  seen  from  this  plot  that  for  the  case  of  180  templates  the  average 
value  of  error  for  variable- width  templates  is  about  12%  less  than  the  error  for  the  same  number 
of  uniform-width  templates.  Or  alternatively,  about  30%  fewer  templates  of  variable-width  need 
to  be  constructed  for  maintaining  the  same  error  rate.  Figure-10  shows  the  distribution  of  error- 
values  for  individual  templates  for  the  two  types  of  template  construction  procedures. 
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MSE  (area) 


1.8 


FTR:  Comparison  of  MSE  for  equal  and  variable  sized  templates  (0-360  degrees) 


Figure-9:  Moan  Square  Error  for  Equal  Number  of  Uniform  and  Variable  Width  Templates 
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Distribution  of  USE-vams  for  120  Uniform  amplatss 


MSETmplalt-Sizs 


FTR  Distribution  of  USE  vatu*  for  120  nriabnl*  sizsd  IsmpIsIM  (0-120  d*gr««) 


Figure  10:  Distribution  of  Error  among  Uniform  and  Variable  Width  Templates 
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This  result  is  significant  in  that  is  demonstrates  that  by  adjusting  the  boundaries  of  templates 
it  is  possible  to  significantly  reduce  the  Mean-Square  Error  within  each  template.  This  should 
result  in  enhanced  performance  for  those  implementations  of  ATR  algorithms  that  use  variable- 
width  templates  instead  of  uniform-width  templates. 

The  comparison  between  uniform-width  and  variable- width  templates  was  performed  for  each 
of  the  five  targets.  It  is  evident  from  these  tests  that  the  weighted  average  of  the  MSEs  can  be 
reduced  by  approximately  10-15  percent,  or  alternatively,  the  same  MSE  level  can  be  maintained 
by  constructing  30-40  percent  fewer  variable-width  templates. 

The  above  results  have  been  possible  with  the  help  of  a  very  crude  heuristic  implemented  only 
for  demonstrating  the  correctness  of  the  approach.  The  intuitive  explanation  for  the  improvement 
is  that  the  structural  features  of  the  target  vehicles  are  aligned  at  the  boundaries  of  variable- width 
templates.  It  is  possible  to  improve  these  heuristics  and  obtain  better  reduction  in  the  MSE  values 
by  constructing  variable- width  templates. 

6  Conclusion 

The  main  contribution  of  this  research  has  been  the  demonstration  of  an  algorithm  for  efficiently- 
constructing  variable-width  templates.  These  variable-width  templates  result  in  template  signals 
for  ATR  that  have  significantly  smaller  data-compression  error.  This  can  help  increase  the  per¬ 
formance  of  ATR  systems.  The  scope  of  this  study  was  limited  and  the  actual  ATR  tests  could 
not  be  carried  out  to  ascertain  the  extent  to  which  the  performance  of  ATR  can  be  improved. 

Other  contributions  of  this  study  are  the  insights  that  have  been  obtained  into  the  character¬ 
istics  of  the  HRR  data  for  ground  vehicles. 
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Alley  C.  Butler 

Robot  Path  Planning  into  Cavities  defined  by  Splines 

ABSTRACT 

This  report  describes  the  development  of  a  robot  path  planning  system  in  which  computational  geometry  is 
used  to  generate  an  optimal  robot  path.  The  architecture  of  this  system  makes  extensive  use  of  B-splines.  The 
system  is  designed  to  support  non-destructive  testing  of  jet  engine  parts  which  contain  multiple  cavities.  These 
cavities  are  modeled  by  B-splines,  and  a  Voronoi  tree  is  developed  by  finding  the  intersection  points  for  offset 
curves  or  hodographs  from  the  B-splines.  In  a  manner  similar  to  the  Voronoi  diagram,  the  Voronoi  tree  is 
maximally  distant  from  adjacent  obstacles.  This  Voronoi  tree,  therefore,  represents  the  path  with  greatest 
clearance.  For  this  reason,  the  Voronoi  tree  is  used  to  guide  path  development  for  a  single  robot  manipulator 
which  is  inserted  into  the  cavities.  The  addition  of  potential  field  methods  allow  development  of  complete  path 
information  for  the  robot  manipulator,  based  on  attaching  the  potential  field  to  the  robot  arm.  In  this  report,  a 
literature  survey  is  provided,  and  the  techniques  for  finding  a  Voronoi  tree  are  described.  Next,  the  adaptation  of 
these  techniques  to  develop  manipulator  paths  using  a  potential  field  is  discussed.  Further  evolution  of  the 
methodology  is  outlined,  and  conclusions  with  recommendations  are  provided. 
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INTRODUCTION 


Reliability  of  high  performance  jet  aircraft  engines  is  important  to  the  combat  survival  of  the  pilot  and  the 
aircraft.  This  reliability  is  ensured  by  periodic  non-destructive  inspection  of  the  jet  engines.  For  the  Pratt  and 
Whitney  F100-PW-229  engine,  the  inspection  is  done  on  a  partly  automated  basis.  Further  automation  of  non¬ 
destructive  inspection  has  the  potential  for  dramatically  reducing  engine  inspection  costs,  thereby  allowing  the  Air 
Force  to  improve  affordability.  Figure  1  shows  a  cut-away  view  of  the  F100-PW-229  jet  engine  (Pratt  &  Whitney, 
1996). 


Figure  1  -  Cutaway  View  of  F100-PW-229  Engine  (Pratt  &  Whitney,  1996) 


Present  methods  involve  eddy  current  inspection  of  engine  parts  using  a  Cartesian  robot  system.  The  path 
planning  process  for  these  Cartesian  systems  takes  approximately  25  hours  of  work  for  manual  path  planning 
through  conventional  teach  pennant  technology.  Recent  advances  in  the  development  of  algorithms  for  automated 
robot  path  planning  have  reduced  this  tedious  and  manpower  intensive  process,  significantly  to  a  matter  of 
minutes.  However,  the  present  techniques  are  not  optimal  in  nature,  and  the  occurrence  of  undetected  collisions 
requires  additional  computational  steps. 

One  of  the  more  challenging  inspection  processes,  involves  inspection  of  engine  internals  which  form  a  set 
of  cavities,  presently  inspected  by  a  Cartesian  robot  with  a  probe  for  flaw  detection  using  eddy  currents.  These 
cavities  must  be  entered  by  a  robot  manipulator  without  collision  with  the  part  that  is  being  inspected.  This 
involves  developing  a  path  which  describes  the  motion  of  the  robotic  arm  as  it  moves  the  probe  into  position  for 
inspection.  The  beginning  point  is  called  the  home  position  by  those  involved  in  operating  the  robot,  but  the 
terminology  in  the  literature  typically  discusses  movement  from  a  start  position,  S,  to  a  goal  position,  G.  Figure  2 
shows  a  sectional  view  of  this  process  for  the  eddy  current  inspection  of  engine  internals. 
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The  sectional  view  represents  a  three  dimensional  engine  part  in  two  dimensions.  This  is  appropriate,  since 
the  engine  is  symmetric  about  its  central  axis.  For  the  purpose  of  path  planning,  two  dimensional  analysis  can  be 
undertaken  as  a  simplification  of  the  more  complex  three  dimensional  problem,  because  of  symmetry  about  the 
central  axis.  This  approach,  in  which  the  path  plan  is  developed  in  two  dimensions  and  subsequently  checked  m 
three  dimensions,  is  the  currently  used  method.  It  is  also  the  approach  employed  in  this  study. 


S  or  Home 


Figure  2  -  Sectional  View  for  Robot  Path  Planning 


The  presently  employed  path  planning  method  includes  a  number  of  steps  which  provide  a  path  without 
collision  from  S  to  G  (Chemaly,  1996).  The  planning  process  begins  with  the  development  of  a  straight  line  from 
S  to  G.  If  this  does  not  intersect  a  boundary,  then  no  adjustments  must  be  made.  However,  this  is  unlikely. 
Adjustments  are  made  for  those  parts  of  the  line  between  S  and  G  which  interfere  (intersect)  with  the  engine  part. 
The  adjustments  are  developed  by  replacing  the  portions  of  the  straight  line  path  which  interfere,  with  a  path 
which  follows  the  surface  of  the  engine  part.  This  is  distinctly  similar  to  the  method  used  by  Lumelsky’s  (1991) 
Bug-2  algorithm.  Once  collisions  are  eliminated  by  replacing  intersecting  paths  with  paths  following  the  part’s 
boundary,  a  limited  technique  for  improving  the  path  can  be  employed.  This  limited  technique  finds  shorter  paths 
by  finding  straight  line  motions  which  eliminate  segments  of  the  path.  The  result  from  this  sequence  of  steps  is  a 
path  which  is  similar  in  principle  to  the  visibility  methods  described  by  Lozano-Perez  and  Taylor  (1989),  and 
identical  to  the  “string  tightened  path”  discussed  by  DuPont  and  Derby  (1988). 


In  the  visibility  methods,  the  path  is  very  close  to  collision,  similar  to  a  string  which  is  stretched  tight 
against  all  obstacles.  This  is  done  to  shorten  the  distance  traveled.  Unfortunately,  this  similarity  to  visibility 
techniques  has  important  drawbacks  in  the  present  application.  Because  the  present  technique  shortens  the  path  m 
a  manner  identical  to  the  visibility  methods,  the  path  is  close  to  collision  for  some  portions  of  the  planned  path. 
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methods  are  also  employed  to  achieve  results,  including  simulated  annealing  (Carrike,  e,  al  1990- 
Venkataraman,  .989,  and  recursive  quadratic  programming  (Leu  *  Sing  ,9“,  2 

by  Piegl  and  Tiller  (1987)  PWi  nooiu  ..  .  v  imer  (1983),  and 

)■  egl  ( 1 99 1 )  has  provided  a  survey  of  NURBS  properties. 

.echniZ:":; -  —  -  -  -  for  the  potential  field 
development  of  an  array  of  techniques  F^Z  7  ^  ^  imerSeCti°nS  °f  sPlines  inc,ud“  the 

a..,  19891  Dokken,  1985).  Sederberg  and  Parry  ^7^  “  'in<’  <*““  « 

based  on  recursive  cohesion.  O^s  I”  ^  which  was 

implementation  of  splines  (Barnhill  et  al„  1987-  B^hni^-  ^"bT*  M  ^  dimenSi°nal 

&  t 

S,Udy  "  ~  —  —  —  —on  widt  offset  curves  or  hodo  “  '  "  ^ 

A  number  of  researchers  have  focused  on  offset  curves  or  hodographs  Klassnosatn  ,  a 
offset  curves  which  follows  the  Hermite  form  (Foley  e,  al.,  ,990,  A  ,  h  ‘  n’e,h°d  ** 

by  Pham  (1988")  Till?  a  u  ’  echmque  for  uniform  B-splines  was  studied 

uy  rnam  (1988).  Tiller  and  Hansen  (1984)  and  Coauillart  riQR7i  „  .  . 

Nurrc  nth  .  r  VaOquillart  (1987)  worked  on  the  development  of  offsets  for 

~r:r;:r^,:rtrr,zi:;99o:s-J 

offsets  for  sphnes  ,s  a  comp.es  process,  aud  they  further  add  that  the  offset  of  a  spline  is  no,  necel^n! 
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(Tiller  &  Hansen,  1984).  Despite  the  complexity,  detecting  intersections  with  offset  curves  is  a  key  process  in 
computational  geometry,  for  path  planning  using  splines. 

In  addition  to  path  planning  using  splines,  the  use  of  spline  architectures  is  useful  in  another  way.  The  path 
planning  process  outlined  here  develops  an  ordered  sequence  of  points  in  2D  for  the  robot  and  its  joints  to  traverse. 
This  ordered  sequence  must  be  translated  into  a  continuous  path  with  smooth  and  continuous  motion.  This  means 
that  the  C  ,  C  ,  and  C  properties  of  B-splines  can  be  used  to  avoid  abrupt  changes  in  velocity  and  acceleration. 
Development  of  splines  from  a  sequence  of  points  has  been  studied  by  Papamichael  and  Soares  (1987)  and  by 
others.  The  use  of  these  splines  for  the  control  of  robot  manipulators  has  also  been  demonstrated  (Patel  &  Lin, 
1988;  and  others).  From  these  this  prior  work,  it  is  apparent  that  development  of  robot  path  plans  using  splines  is 
easily  feasible. 


DEVELOPMENT  OF  A  SPLINE  BASED  MODEL 

Finding  the  Voronoi  Tree 

The  name  Voronoi  tree  has  been  chosen  to  describe  the  technique  outlined  here,  because  of  the  similarity  to 
the  Voronoi  diagram  which  is  maximally  distant  from  all  objects  in  a  two  dimensional  space  (Lozano-Perez  & 
Taylor,  1989).  Both  the  Voronoi  diagram  and  the  Voronoi  tree  described  here,  provide  paths  for  robot  motion 
which  avoid  collisions  to  the  greatest  extent  allowed  by  the  geometry,  and  this  should  permit  solution  of  the  path 
planning  problem  in  two  dimensions  without  difficulties  when  the  plan  is  checked  in  3D. 

The  cavities  found  in  the  jet  engine  part,  as  shown  in  Figure  2,  represent  parts  which  should  be  described  by 
splines  in  a  CAD  system.  The  Voronoi  tree  is  developed  directly  from  the  spline  geometry.  Therefore,  this  spline 
based  approach  is  both  natural  and  efficient  for  path  planning. 

Offset  curves  from  splines  can  be  found  for  the  cavity  geometry.  The  utility  of  these  curves  or  hodographs 
was  originally  recognized  by  Air  Force  personnel  (Dr.  LeClair)  with  subsequent  application  here.  These  offset 
curves  or  hodographs  follow  the  contour  of  the  spline,  but  they  exist  at  an  offset  distance,  d,  from  the  spline.  The 
equations  which  describe  the  offset  curves  are  parametric  in  t  in  a  manner  similar  to  the  spline  curves  (Hoschek, 
1985): 

x'(t)  =  x(t)  +  d(nx) 
y'(t)  =  y(t)  ±  d(ny) 

where  x\  y’  describe  the  offset  curve,  x,  y  describes  the  spline,  d  is  the  offset  distance,  and  n  represents  the 
normal  to  the  spline  curve.  Offset  curves  can  be  developed  for  different  values  of  d. 


(1) 

(2) 
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If  offset  curves  from  two  surfaces  described  by  splines  intersect,  rhe  pom,  of  intersection  is  a  distance  d  from 
•cb  spline  curve.  So,  the  points  of  intersection  for  offset  curves  (having  identical  vaiues  of  d,  are  equidistant  from 
7  sur  aces.  By  intersecting  muitiple  ofrse,  curves  with  different  vaines  of  d,  points  which  are 
from  both  surfaces  can  be  found.  By  connecting  these  points,  a  Voronoi  free  is  deveioped  in  which  points  on  me 

Hgure;i“,0adia“"tSUrfaCeS'  ™SiSa'-tec“~by,hiss,ud,a„di,isi,Iusfru,edi„ 


Adding  Robot  Linkage 

Although  the  Voronoi  free  represents  a  path  which  is  equally  distant  to  all  bound, ng  splines  i,  does  not 
prov.de  information  on  how  a  robofs  linkages  should  be  con, r„, led  to  enter  a  cavity,  Fo,  this  reason  additional 
ompu  ation  is  required  to  find  the  robofs  optima,  configuration  as  i,  traverses  the  Voronoi  free  from  S  to  G  To 
ajplish  ,h,s  portion  of  the  path  planning  problem,  ofrse,  curves  are  again  required.  These  offset  curves  are 

methl  H° Pr°7o™  °f  ,he  P0“ial  f°r  inMrSeC,i0"'  AS  *»  **■»*  is  simiiar  to  potential  field 

artifieia  ^7  ^  ^  ““  ^  Md  -  ^elopmen,  of  an 

.al  which  ts  used  to  surround  all  obstacles.  Although  this  artificial  potential  fieid  is  based  on  simpi, 

mg  terms  to  the  pad,  planning  algorithm,  these  added  terms  provide  a  means  of  "repelling-  the  robot  awa„ 
0Ptiml'  ^  -  -  —  *om  coumigl: 


Figure  3  -  Developing  Points  on  the  Voronoi  Tree  with  Intercepting  Offset  Curves 


In  the  technique  advocated 
fields  are  associated  with  obstacles. 


here,  there  is  one  difference  to  other  potential  field  methods.  Most  potential 
However,  if  that  approach  is  used  here  there  will  be  multiple  offset  curves,  and 
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the  offset  curves  will  require  significant  computational  time.  Additionally,  the  intersections  between  the  offset 
curves  and  the  polygonal  boundaries  of  the  robot  must  be  found.  This  is  a  relatively  complex  computational  task. 

To  limit  these  difficulties,  an  equivalent  approach  is  used.  The  potential  field  is  attached  to  the  robot  arm. 
This  allows  offsets  to  be  constructed  as  simple  lines  which  are  parallel  to  the  polygon  which  represents  the  robot’s 
arm.  These  lines  are  intersected  with  the  splines  which  form  the  cavity.  This  represents  a  very  efficient  way  to 
represent  and  compute  potential,  since  the  lines  all  intersect  with  a  spline  which  has  known  control  points.  (The 
control  points  of  the  offset  curve  would  have  to  be  determined  as  added  computational  burden.)  These  control 
points  form  a  convex  hull  around  portions  of  the  curve,  and  this  permits  checking  of  line  intersections  with  the 
convex  hull.  The  checking  of  intersection  with  the  convex  hull  is  a  local  property  of  the  spline  curve,  and  line- 
splme  intersections  can,  therefore,  be  rapidly  and  easily  found.  This  is  a  significant  innovation  directly  inspierd  by 
close  collaboration  with  an  Air  Force  research  enginer  (again,  Dr.  LeClair). 


As  presently  implemented  in  C  language  code,  the  intersection  of  a  line  and  spline  is  found  by  a  hierarchical 
method  which  rapidly  excludes  trivial  cases  in  the  computational  hierarchy.  This  process  of  descending  down  the 
hierarchy  is  developed  to  maximize  computational  speed.  It  involves  excluding  line  spline  intersection  first  by  a 
bounding  box.  Then,  exclusion  of  line-spline  intersection  is  accomplished  by  checking  for  intersection  with  the 
convex  hull  for  the  spline.  If  the  convex  hull  is  intersected,  then  Newton-Raphson  procedures  are  employed  to 
determine  the  point  of  intersection  (Mortonson,  1985;  Mortonson,  1990)  using  data  from  the  earlier  computations 

on  the  convex  hull.  This  provides  line-spline  intersection  data  rapidly  and  avoids  expensive  computation  for  non¬ 
intersecting  cases. 

Once  line-spline  intersections  are  determined,  the  level  of  potential  field  interference  with  the  cavity 
boundaries  as  obstacles  can  be  determined.  This  potential  field  is  based  on  the  length  of  the  offset  lines  which 
intersect  the  splines.  Equation  (3)  below  shows  how  the  potential  function  should  be  constructed.  The  first  term 
involves  the  development  of  a  measure  for  the  Euclidean  distance  between  the  manipulator  tip  at  xt,  y,  and  a  point 
on  the  Voronoi  tree  as  x*  y*.  This  term  is  multiplied  by  a  factor  K  which  is  adjusted  to  obtain  a  good  balance 
between  robot  arm  position  and  motion  of  the  manipulator  tip  relative  to  the  Voronoi  tree.  The  second  term  sums 
the  product  of  the  weight  associated  with  an  offset  curve  as  w,  and  the  length  of  intersection  as  L,  between  the 
offset  curve  and  the  spline  describing  the  cavity.  The  value  of  U  should  reflect  the  severity  of  interference  between 
the  obstacle  and  the  offset  line.  Of  course,  offset  lines  at  greater  distance  from  the  robot  arm  would  have  lesser 

weight  as  Wj,  since  the  probability  of  collision  is  less.  The  result  is  a  measure  as  <5  of  how  close  the  robot  arm  is  to 
a  collision. 

o(x,y)  =  K-J(x  —  x, )  +(y*-yt)  +  2  L;  Wj  (3) 
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With  this  measure  available,  an  optimal  position  for  the  robot  arm  can  be  found  to  avoid  collision.  This 
technique  is  illustrated  in  Figure  4  for  three  linkages  and  a  portion  of  the  part. 


Spline  for  Part  Boundary 


Robot  Arm 

Offset  Curve 
(Line) 


Figure  4  -  Developing  a  Potential  Field  Attached  to  the  Robot  Arm 

EVOLUTION  OF  THE  METHODOLOGY 

Time  allocated  to  the  research  has  been  finite,  and  choices  regarding  which  veins  of  research  to  pursue 
aggressively  had  to  be  made.  Despite  limitations  in  time  and  resources,  several  key  steps  have  been  completed 
which  represent  an  important  beginning.  The  steps  which  were  completed  include  an  orientation  to  the  problem, 
an  in  depth  literature  review  (which  is  only  partially  represented  in  this  report),  development  of  key  research 
concepts  and  approaches,  and  commencement  of  programming  for  a  prototype  system.  The  programming  was 
conducted  in  the  C  language  in  a  Unix  and  X  Windows  environment. 

Due  to  limited  time,  some  processes  are  deferred.  These  include  the  choice  of  offset  curve  architecture,  and 
the  optimization  of  hodograph-hodograph  intersection  algorithms.  Present  methods  require  human  computer 
interaction  which  necessitates  limited  user  involvement. 


Additionally,  programming  is  continuing  on  the  development  of  the  potential  field  technique.  Code  for 
representation  and  rendering  of  splines  as  non-uniform,  non-rational  b-splines  is  presently  in  place.  Intersection 
algorithms  for  line-spline  intersection  have  been  developed  using  methods  outlined  by  Prasad  (1991)  as  improved 
by  Shafer  and  Fuestel  (1992)  for  intersections  with  the  convex  hull.  These  methods  are  hierarchical  and  represent 
an  approach  designed  for  fast  computation.  The  development  of  other  functions  and  routines  for  the  path  planning 
system  are  continuing,  with  the  development  of  algorithms  to  determine  <D  in  equation  (3)  as  the  next  priority. 


8-12 


After  development  and  testing  of  this  code,  a  simulated  annealing  approach  is  planned  for  the  optimization  of  <t>. 
This  will  demonstrate  the  feasibility  and  utility  of  the  overall  Voronoi  tree  and  potential  field  methods. 

CONCLUSIONS  AND  RECOMMENDATIONS 

As  a  result  of  this  investigation,  there  are  a  number  of  conclusions  which  can  be  drawn  from  this  work. 
These  conclusions  are  represent  in  the  following: 

1.  Most  of  the  literature  discusses  path  planning  in  an  environment  with  obstacles  described  by  polytopes 
(polygons  in  2D  and  polyhedra  in  3D).  A  few  other  papers  discuss  path  planning  using  other  regular 
geometric  shapes  like  the  ellipse,  circle,  or  cone.  The  author  is  unaware  of  any  work  which  investigates  path 
planning  into  cavities  defined  by  splines. 

2.  Computational  speed  is  a  key  issue  in  the  benchmarking  of  path  planning  systems.  There  is  a  trade-off 
between  using  faster  methods  which  require  extensive  checking  and  rework  due  to  collisions  in  3D,  and  a 
method  which  may  take  longer  to  avoid  the  extensive  checking  and  rework.  Therefore,  computational  speed 
continues  to  be  an  important  item  on  the  research  agenda. 

3.  It  is  anticipated  that  solutions  will  be  sensitive  to  the  values  of  d  in  the  offset  curves  or  hodographs  (equations 
1  and  2),  and  to  the  values  of  K  and  w,  in  equation  (3).  Experience  with  the  method  should  be  the  best  guide 
to  generate  proper  solutions. 

4.  The  algorithms  described  here  for  Robot  path  planning  have  no  proof  of  convergence.  The  result  is  that  the 
algorithms  may  fail  under  certain  conditions.  It  is  expected  that  the  outcome  should,  however,  be  similar  to 
Hwang’s  (1992)  results  in  which  failure  occurs  over  a  very  limited  subset  of  problems. 

There  is  one  key  recommendation:  The  work  which  was  begun  through  this  study  should  continue.  This 
involves  additional  programming  and  program  testing  which  can  be  accomplished  in  the  coming  months.  As 
secondary  recommendations,  robot  dynamics/kinematics  should  be  considered  more  fully  in  future  approaches 
(Seereeram  &  Wen,  1995;  Xue  &  Sheu,  1989),  and  a  migration  to  a  continuous  path  approach  should  also  be 
considered  (Brady,  1982). 

The  research  outlined  here  uses  a  novel  approach  to  accomplish  path  planning  for  robot  arms  which  travel 
into  cavities.  It  is  believe  that  the  methods  will  prove  superior  to  existing  techniques,  and  improvements  will  drive 
down  inspection  costs.  This  will  make  greater  levels  of  reliability  needed  by  the  Air  Force  more  affordable. 
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ABSTRACT 


We  analyze  the  importance  of  current  crowding  in  a  new  cold  cathode  emitter  which  consists 
of  a  thin  wide  bandgap  semiconductor  material  sandwiched  between  a  metallic  or  heavily 
doped  semiconductor  and  a  low  work  function  semimetallic  thin  film.  Potential  material 
candidates  are  suggested  to  achieve  low- voltage  (<  10  V),  room-temperature  cold  cathode 
operation  with  emission  currents  of  several  tens  of  A/  cm2.  We  calculate  the  lateral  potential 
drop  which  occurs  across  the  emission  window  of  cold  cathodes  with  circular  geometry  and 
describe  its  effects  on  the  emitted  current  density  profile.  The  power  dissipation  in  the 
cold  cathode  is  calculated  as  a  function  of  a  dimensionless  parameter  characterizing  the 
importance  of  current  crowding.  We  determine  the  range  of  dc  bias  over  which  cold  cathodes 
of  different  radii  must  be  operated  to  minimize  current  crowding  and  self-heating  effects. 
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I.  INTRODUCTION 


Recently,  there  has  been  renewed  interest  into  cold  cathode  emitters  for  applications  to 
a  variety  of  electronic  devices,  including  microwave  vacuum  transistors  and  tubes,  pressure 
sensors,  thin  panel  displays,  high  temperature  and  radiation  tolerant  sensors,  among  others 
[1,  2].  Introduction  of  such  emitters  would  permit  an  unprecedented  compactness  and  weight 
reduction  in  device  and  equipment  design.  Low  temperature  operation  in  nonthermionic 
electron  emitters  is  very  desirable  for  keeping  the  statistical  energy  distribution  of  emitted 
electrons  as  narrow  as  possible,  to  minimize  thermal  drift  of  solid  state  device  characteristics, 
and  to  avoid  accelerated  thermal  aging  or  destruction  by  internal  mechanical  stress  and 
fatigue.  To  keep  the  emitter  temperature  rise  small  appears  easy  if  the  emitters  are  built  as 
thin  epitaxial  films  using  vertical  layering  technology  due  to  the  extremely  short  heatpaths 
and  excellent  heatsinking  possibilities  offered  with  this  architecture.  For  an  electron  emitter 
to  be  useful  in  microwave  tube  applications  it  should  be  capable  of  delivering  current  densities 
in  excess  of  10  A/  cm 2  and  to  sustain  emission  during  operational  lifetimes  over  periods  of 
10s  hrs.  To  satisfy  this  requirement,  the  structural  and  chemical  composition  must  be  stable. 
This  rules  out  the  historically  practiced  use  of  alkali  metal  films  on  emitter  surfaces  for  the 
lowering  of  electronic  work  functions.  These  films  sublimate,  evaporate  or  surface  migrate 
over  time  and  end  up  on  various  surfaces  inside  the  vacuum  envelop. 

Several  cold  cathode  emitters  have  been  proposed  since  their  first  successful  demon¬ 
stration  by  Williams  and  Simon  [3]  using  a  cesiated  p-type  GaP  structure.  A  review  and 
criticism  of  the  different  cold  cathode  approaches  was  given  recently  by  Akinwande  et  al. 
[4].  In  this  work,  we  propose  a  new  cold  cathode  emitter  concept  and  use  a  simple  model  to 
show  that  the  new  emitter  is  capable  of  achieving  low  voltage  (<  10  V)  room  temperature 
operation  with  emission  current  approaching  100  A/cm2  and  large  efficiencies.  A  prelimi¬ 
nary  report  of  this  work  has  been  published  earlier  [5].  The  architecture  of  the  structure  is 
shown  in  Fig.  1.  The  main  elements  in  the  design  and  functioning  of  such  an  emitter  are 
:  (1)  a  wide  bandgap  semiconductor  slab  equipped  on  one  side  with  a  metallic  contact  [6] 
or  a  heavily  doped  semiconductor  (n++  —  InP )  on  one  side  of  an  undoped  CdS  region  that 
supplies  electrons  at  a  sufficient  rate  into  the  conduction  band  and  (2)  on  the  opposite  side, 
a  thin  semimetallic  film  that  facilitates  the  coherent  transport  (tunneling)  of  electrons  from 
the  semiconductor  conduction  band  into  vacuum.  Of  importance  is  the  mutual  alignment 
of  the  crystalline  energy  levels  at  the  semiconductor-semimetal  film  junction.  This  requires 
the  use  of  new  materials  and  development  of  their  epitaxial  growth  technologies.  For  that 
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reason,  the  choice  of  InP  as  a  substrate  is  particularly  attractive  since  the  lattice  constant 
of  InP  (5.S6  A)  closely  matches  the  lattice  constant  of  the  zincblende  cubic  CdS  (5.83  A). 
Furthermore,  there  have  been  recent  reports  on  the  deposition  of  crystalline  layers  of  CdS  on 
InP  by  molecular  beam  epitaxy  [7],  chemical  bath  deposition  [8],  and  pulsed  laser  deposition 
[9].  The  proposed  cold  cathode  should  therefore  be  realizable  with  present  day  technology. 

As  shown  in  Fig.  1(a),  a  thick  metal  grid  is  defined  on  the  surface  of  the  LaS  thin 
film  to  bias  the  structure.  There  are  openings  in  the  grid  structure  to  expose  the  thin  LaS 
film  which  forms  the  active  emission  area  of  the  cold  cathode.  Cathodes  with  rectangular 
(Fig.l(b))  emission  windows  were  studied  previously  [5].  Current  crowding  and  self-heating 
effects  in  cathodes  with  circular  geometry  (Fig. 1(c))  emission  windows  will  be  considered 
hereafter.  The  bias  is  applied  between  the  back  metallic  contact  and  the  metal  grid  with 
emission  occuring  from  the  exposed  LaS  surface.  If  the  applied  voltage  is  equal  or  larger 
than  the  semiconductor  bandgap  energy  and  the  quotient  of  the  applied  voltage  divided  by 
the  semiconductor  thickness  approaches  O.leV/A,  then  electrons  are  tunnel  injected  into  the 
conduction  band  and  ascend  during  their  travel  across  the  semiconductor  film  to  levels  of 
increasing  energy.  Referring  to  Fig.  2,  the  conduction  band  of  the  wide  bandgap  semicon¬ 
ductor  provides  the  launching  site  for  electrons  where  they  are  -  through  a  thin  film  -  injected 
into  vacuum.  This  injection  of  electrons  into  vacuum  becomes  possible  and  is  effective  as 
long  as  the  semimetallic  film  is  very  thin  and  has  a  work  function  small  enough  so  that  its 
vacuum  edge  is  located  energetically  below  the  conduction  band  edge  of  the  semiconductor. 
This  situation  is  referred  to  as  negative  electron  affinity  (NEA)  for  the  semiconductor  ma¬ 
terial  [10].  Depending  on  the  particular  materials  choices,  this  implies  that  the  semimetal 
work  function  <f>M  in  relation  to  the  semiconductor  energy  bandgap  Eg  must  obey  one  of  the 
inequalities  <}>m  <  0 ,5Ea  or  <j>M  <  Eg  if  an  intrinsic  or  p-type  doped  wide  bandgap  semi¬ 
conductor  is  used,  respectively.  A  negative  <f>M  implies  according  to  Fig.2  that  the  vacuum 
level  would  be  located  below  the  lower  conduction  band  edge.  In  that  case,  electrons  in 
the  conduction  band  with  momenta  pointing  toward  the  surface  have  a  good  chance  to  get 
emitted  unless  deflected  by  collision  or  trapped  by  impurities  or  defects. 

This  paper  is  organized  as  follows.  In  section  II,  we  derive  the  basic  equations  describing 
the  forward  bias  operation  of  the  cold  cathode  emitter  described  above.  We  then  calculate 
the  current  density- voltage  characteristics  of  the  newly  proposed  cold  cathode  for  specific  sets 
of  materials  and  device  parameters.  In  section  III,  we  investigate  the  importance  of  current 
crowding  effects  in  various  cold  cathodes  with  circular  geometry.  Our  analysis  includes  a 


9-4 


self-consistent  modeling  of  current  crowding  effects  and  an  analysis  of  power  dissipation  in 
the  cold  cathode  active  area.  The  influence  of  power  dissipation  on  self-heating  effects  in  the 
active  area  of  the  cathode  is  also  described.  Finally,  Sec.  IV  contains  our  conclusions. 

II.  THE  MODEL 

Hereafter,  we  analyze  the  cold  cathode  whose  energy  band  diagram  is  shown  in  Fig.  2 
[5],  Under  the  influence  of  a  large  electric  field  in  the  wide  bandgap  semiconductor,  electrons 
will  eventually  tunnel  from  the  left  contact  through  the  barrier  at  the  metal-semiconductor 
interface.  A  portion  of  the  current  emitted  at  the  metal  or  heavily  doped  semiconductor 
- CdS  contact  (which  we  model  assuming  Fowler- Nor dheim  injection)  is  transmitted  at  the 
boundary  of  the  LaS  as  well  as  the  vacuum  boundary.  However,  a  fraction  of  the  current  is 
lost  in  the  thin  LaS  quantum  well  gives  rise  to  the  dynamic  shift  of  the  effective  material 
work  function  (Fig.  2).  For  a  cathode  operated  at  room  temperature,  we  model  this  internal 
field  emission  at  the  injection  junction  using  a  Fowler-Nordheim  (FN)  type  expression  for 
the  injected  current  (in  A jcrn?)  [11] 

Jfn  =  Ci(E2/A)ec'lA3/2/E,  (1) 

where  C\  and  C2  are  constants  which  depend  on  the  wide  bandgap  semiconductor.  In  our 
numerical  simulations,  we  chose  C\  —  1.5AT106  A/V  and  C2  —  6.9JV10 7(VV2cm)-1  which 
are  of  the  same  order  of  magnitude  as  the  constants  appearing  in  the  FN  expression  [11]. 
In  Eq.(l),  A  is  the  barrier  height  (in  eV)  at  the  metal-semiconductor  junction  and  E  is 
the  electric  field  (in  V/cm)  in  the  wide  bandgap  semiconductor  [12].  We  assume  that  the 
semiconductor  layer  thickness  is  such  that  the  transport  of  injected  electrons  is  close  to  being 
ballistic  up  to  the  interface  between  the  semiconductor  and  the  thin  semimetallic  film.  In  so 
doing,  we  also  neglect  carrier  ionization  processes  in  the  semiconductor  slab  which  could  be 
the  main  antagonist  to  ballistic  transport  in  that  region. 

Because  of  the  finite  probabilities  for  the  injected  current  to  be  transmitted  at  the 
semiconductor-semimetal  (probability  7i)  and  semimetal- vacuum  interfaces  (probability  T2), 
the  contributions  to  the  total  emitted  current  can  be  calculated  as  the  sum  of  the  contribu¬ 
tions  resulting  from  the  mutiple  reflections  of  electrons  in  the  semimetallic  layer  (See  Fig.  2). 
The  magnitudes  of  the  emitted  current  components  decreases  with  the  number  of  multiple 
reflections  in  the  semimetallic  layer.  Rather  than  trying  to  calculate  these  contributions 
exactly,  we  assume  that  the  current  amplitude  is  decreased  by  a  factor  e  =  exp(— T2/ -Vus) 
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for  each  traversal  of  the  semimetallic  layer,  where  Xus  is  the  collisional  mean  free  path  in 
the  semimetallic  layer  and  L2  is  the  length  of  the  semimetallic  layer.  Adding  the  contribu¬ 
tions  resulting  from  multiple  crossings  of  the  semimetallic  layers,  the  total  emitted  current 
is  found  to  be 

Jem  =  €.T\T2Jq(\  +  x  +  x2  +  ...),  (2) 

where  x  —  e2(l  —  Ti)(l  —  T2).  In  calculating  Jem  we  limited  the  number  of  traversals  of 
the  semimetallic  slab  to  five  to  include  the  fact  that  electrons  loose  energy  in  each  crossing 
and  eventually  do  not  have  enough  energy  to  surmount  the  barrier  at  the  semimetal- vacuum 
interface.  According  to  Eq.(2),  the  contributions  from  the  multiple  reflections  decrease 
rapidly  since,  in  general,  the  quantity  x  will  be  much  smaller  than  unity  [13].  Once  the 
emitted  current  is  found,  the  total  current  contributing  to  the  increase  in  the  sheet  carrier 
concentration  in  the  thin  semimetallic  film  can  easily  be  written  as  Jcapt  =  Jfn  —  Jem.  The 
total  trapped  current  is  then  given  by 

~  AJRt JfN  (3) 

where  A  is  the  area  of  each  LaS  emission  window  which  in  practice  can  be  either  rectangular 
or  circular  (See  Fig.  lb  and  lc).  In  Eq.(3),  R  is  the  trapping  coefficient  of  the  well 

R  =  1  -  eTj T2  (1  +  x  +  x2).  (4) 

The  semimetallic  thin  film  can  be  modeled  as  a  quantum  well  (Q.W)  which  will  loose  the 
trapped  electrons  essentially  at  its  lateral  boundaries.  In  reality,  Fig.  1(a)  indicates  that 
not  all  electrons  will  move  to  the  three-dimensional  contact  regions  surrounding  the  thin 
semimetallic  layer  but  many  of  them  will  get  reflected  at  the  lateral  thin  film  layer  with 
an  average  probability  r  (calculated  for  electrons  with  the  Fermi  velocity  in  the  thin  film). 
The  exiting  number  of  electrons  will  depend  on  the  thickness  of  the  semimetallic  layer  and 
could  be  adjusted  by  intentional  passivation  so  that  reflection  at  the  boundaries  of  the  thin 
semimetallic  film  could  be  tuned  from  almost  zero  to  nearly  unity.  Taking  into  account  the 
finite  reflection  amplitude  at  the  thin  film  boundaries,  the  leakage  current  of  the  Q.W  can 
be  rewritten 

=  2eIiV2£)UF(l  -  r),  (5) 

for  the  case  of  a  rectangular  emission  window  and 

dQr 

=  2-rreaN2DVF(l  -  r),  (6) 
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for  the  case  of  a  circular  emission  window. 

In  Eqns.(5)  and  (6),  Qt  is  the  total  charge  captured  by  the  well,  e  is  the  magnitude  of 
the  electronic  charge,  N2d  is  the  excess  sheet  carrier  concentration  in  the  thin  film  due  to 
the  captured  electrons,  and  vF  is  the  Fermi  electron  velocity  in  the  semimetallic  thin  film. 
Under  steady  state  operation  of  the  cold  cathode,  the  excess  charge  in  the  two-dimensional 
semimetallic  film  is  found  using  Eq.(3)  and  imposing  the  current  balance  requirement  ^  = 
ir  —  AJcapt ■  This  leads  to 

N2D  =  W  Jcapt/2e(l  —  r)vF,  (7) 

for  the  case  of  a  rectangular  geometry  and 

N2d  =  aJcapt/ 2e(l  -  r)vF,  (8) 

for  the  case  of  a  circular  geometry. 

Simultaneously,  the  change  N2d  in  the  excess  sheet  carrier  concentration  in  the  Q.W 
due  to  trapped  electrons  leads  to  the  occupation  of  the  boundstate  energy  levels  according  to 
the  energy  density  of  states  up  to  an  energy  level  which  will  establish  the  dynamic  Fermi  level 
Ei?1.  The  Fermi  velocity  vF  entering  Eqns.(5)  and  (6)  must  be  calculated  self-consistently 
because  of  the  dynamic  work  function  shift  jAxI  illustrated  in  Fig.  2.  This  dynamical  shift 
|Ax|  is  equal  to  \EFl  -  EF°\,  where  EF°  is  the  Fermi  level  in  the  thin  semimetallic  layer 
under  zero  bias.  For  simplicity,  we  assume  that  the  electrons  in  the  conduction  band  of  the 
semimetallic  films  can  be  described  using  the  Sommerfeld  theory  of  metals  while  assuming 
s-band  conduction  in  the  semimetallic  thin  film  and  while  modeling  the  thin  film  using  the 
particle  in  a  box  model  for  the  quantum  well  [14].  The  set  of  equations  (1-8)  is  then  solved 
self-consistently  to  calculate  the  work  function  shift  ]Ax|  as  a  junction  of  the  externally 
applied  bias.  Once  the  dynamic  shift  has  been  determined  self-consistently,  Eq.(2)  can  then 
be  used  to  determine  the  emitted  current. 

RESULTS 

We  consider  a  specific  structure  with  the  material  and  structural  parameters  listed  in 
Table  I  and  II,  respectively.  Both  Au  and  Ag  are  known  to  form  contacts  to  thin  films  of 
semiconducting  (n-type)  CdS.  In  that  case,  the  barrier  height  A  shown  in  Fig.  2  is  quite 
small  and  is  equal  to  0.7S  eV  and  0.56  eV  for  the  case  of  Au  and  Ag  contacts,  respectively 
[11].  The  lattice  constant  of  CdS  (5.S3A)  is  very  close  to  the  lattice  constant  of  the  thin 
semimetallic  surface  layer  LaS  (5.85  A)  which  in  its  cubic  crystalline  structure  will  therefore 
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be  lattice  matched  to  the  semiconducting  material.  Additionally,  LaS  is  expected  to  have 
quite  a  low  room  temperature  work  function  (1.14  eV)  [15],  a  feature  when  combined  with 
the  large  energy  gap  (2.5  eV)  of  CdS  leads  to  NEA  of  the  semiconductor  material.  In  the 
following  numerical  simulations,  the  thicknesses  of  the  CdS  ( L\ )  and  LaS  ( L 2)  layers  are  set 
equal  to  500  A  and  24.6  A  (4  monolayers),  respectively.  We  model  a  cathode  with  a  square 
(W  =  L)  emission  window  with  a  1  cm2  area. 

Figure  3  is  a  plot  of  the  dynamic  work  function  shift  as  a  function  of  applied  bias  for 
the  cold  cathode  emitter  with  both  Au  and  Ag  injecting  contacts.  The  following  parameters 
were  used:  A  ms  =  300A,  T\  =  Ti  —  0.5,  and  vf  =  1.36-X108  cm/s.  Figure  3  indicates  that 
the  dynamic  shift  of  the  LaS  work  function  is  sensitive  to  the  quality  of  the  interface  between 
the  two-dimensional  semimetallic  layer  and  the  three-dimensional  contacts  which  we  model 
by  varying  the  reflection  coefficient  r  between  the  two-dimensional  semimetallic  thin  film 
and  the  three-dimensional  contact  regions  (See  Fig.  1(a)).  It  should  be  noticed  that  the  LaS 
work  function  shift  can  approach  the  LaS  workfunction  even  for  the  case  of  a  leaky  interface 
between  the  thin  semimetallic  layer  and  the  3D  contact  regions.  The  dynamic  shift  |Ax|  is 
comparable  to  the  work  function  of  LaS  for  a  smaller  value  of  the  applied  bias  in  the  case  of 
Ag  contact  because  of  the  lower  barrier  at  the  Ag/ CdS  interface. 

Figure  4  compares  the  emitted  current  densities  Jm  for  the  structure  with  Au  and  Ag 
contacts  calculated  while  including  or  neglecting  the  effects  of  the  dynamic  shift  of  the  LaS 
work  function.  The  current  density  versus  bias  plots  are  stopped  at  the  values  of  Vbia3  at 
which  |Ay|  =  4>M(LaS)  =  1.14eV.  Beyond  that  point,  the  theory  exposed  here  is  no  longer 
valid  since  we  would  need  to  include  the  spill  over  of  the  excess  trapped  carriers  into  vacuum. 
As  can  be  seen  in  Fig.  4,  the  emitted  current  densities  can  be  more  than  a  factor  two  larger 
when  the  effects  of  the  dynamic  shift  of  the  work  function  of  the  semimetal  are  included. 
The  effects  could  be  made  more  drastic  if  a  set  of  materials  and  device  parameters  could  be 
found  for  which  the  dynamic  shift  of  the  work  function  could  be  made  comparable  to  the 
work  function  itself  at  fairly  low  value  of  the  applied  bias  (<  5F). 

Sensitivity  of  Dynamic  Work  Function  Shift  on  Design  Parameters 

The  previous  numerical  examples  have  shown  that,  under  forward  bias  operation,  the 
electrons  captured  in  the  low  work  function  material  are  responsible  for  an  effective  reduction 
of  the  semimetallic  film  work  function  together  with  a  substantial  increase  of  the  cathode 
emitted  current.  This  dynamic  work  function  shift  was  shown  to  increase  with  the  amount  of 
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injected  current.  Hereafter,  we  perform  a  more  extensive  study  of  the  dynamic  work  function 
shift  which  includes  variations  of  the  length  of  the  CdS  region  (Iri),  the  electron  mean  free 
path  in  the  LaS  region  (A us),  the  emission  window  size  (W),  the  transmission  coefficients 
at  the  CdS/ LaS  (Ti)  and  LaS /V acuum  interfaces  (T2),  and  the  reflection  at  the  2D/3D 
interface  region  in  the  LaS  quantum  well  (r). 

Figure  3  indicates  that  the  dynamic  work  function  shift  |  Ax)  rises  exponentially  above 
a  threshold  voltage  of  several  volts  and  reaches  rapidly  (within  a  few  volts  range)  a  value 
comparable  to  the  LaS  work  function.  For  a  structure  with  the  parameters  listed  in  Table  I 
and  with  the  structural  and  physical  parameters  ( L\  =  500  A,  L2  =  24.6A,  W  =  1cm,  A ias  — 
300A,  Ti  =T2  =  0.5,  A(AS)  =  0.56eV  ),  Fig.  5(a)  shows  that  the  dynamic  work  function 
shift  rises  exponentially  at  a  lower  bias  as  the  reflection  coefficient  at  the  2D/3D  interface 
in  the  LaS  region  approaches  unity.  Figure  5(b)  also  shows  that  the  difference  between 
the  current  densities  calculated  with  and  without  including  the  dynamic  work  function  shift 
are  more  pronounced  for  smaller  values  of  the  applied  bias  when  the  reflection  coefficient 
between  the  2D  and  3D  LaS  regions  is  approaching  unity.  This  results  from  the  fact  that 
any  mechanism  (like  r  being  closer  to  unity)  which  increases  the  amount  of  charge  being 
trapped  in  the  LaS  quantum  well  leads  to  an  enhancement  of  the  dynamic  work  function 
shift  at  a  given  bias.  For  instance,  all  other  cathode  parameters  being  equal,  the  dynamic 
work  function  rises  much  faster  as  a  function  of  applied  bias  in  structures  with  thinner  CdS 
regions  (Fig.  6),  with  smaller  values  of  the  transmission  coefficients  T\  and  T2  (Fig.  7),  or 
with  smaller  mean  free  path  (A£as)  in  the  semimetallic  thin  film  (Fig.  8).  As  shown  in  Fig. 
8(a),  the  dynamic  work  function  shift  occurs  at  a  lower  bias  as  A ias  is  decreased.  On  the 
other  hand,  the  emitted  current  density  is  lesser  at  a  given  bias  in  a  cathode  whose  LaS  thin 
film  has  a  lower  electron  mean  free  path  (Fig.  8(b)).  We  have  found  this  trend  to  be  valid 
for  all  values  of  the  reflection  coefficient  at  the  LaS  2D/3D  interface.  However,  there  is  a 
larger  spread  in  the  family  of  curves  representing  the  bias  dependence  of  the  dynamic  work 
function  and  emitted  current  densities  as  a  function  of  the  mean  free  path  in  the  LaS  thin 
film  for  smaller  values  of  the  reflection  coefficient  r.  Finally,  even  though  not  shown  here, 
the  exponential  rise  of  the  dynamic  work  function  shift  and  emitted  current  density  has  been 
shown  to  occur  at  lower  value  of  the  bias  by  either  reducing  the  thickness  of  the  CdS  layer 
or  by  lowering  the  barrier  height  A  at  the  metal /CdS  interface. 

Before  leaving  this  section,  we  make  two  additional  remarks  on  the  bias  dependence 
of  the  emitted  current  density  and  the  dynamic  work  function  shift  which  we  have  checked 
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numerically  on  all  the  cold  cathodes  modelled  in  this  work.  First,  since  the  emission  of 
electrons  at  the  metal(or  heavily  semiconductor)/CdS  interface  is  assumed  to  be  of  the 
Fowler-Nordheim  type,  the  emission  current  is  expected  to  have  a  bias  dependence  of  the 
form 

Jfn  =  AVbia32exp(-B /Vbias)-  (9) 

This  was  checked  numerically  for  all  the  cold  cathodes  modelled  here.  A  typical  example  is 
given  in  Fig.9(a),  with  the  values  of  the  parameters  A  and  B  listed  in  the  inset.  Most  of 
the  quantities  of  interest  to  be  determined  hereafter  (power  dissipation,  lateral  variation  of 
emitted  current  and  lateral  potential  drop,...)  can  be  calculated  exactly  analytically  if  the 
following  approximation  is  used 

Jfn  =  JoeaVbias.  (10) 

For  all  the  cold  cathodes  simulated  here,  we  have  shown  that  Eq.(10)  gives  a  fairly  accurate 
fit  to  the  plot  of  the  emitted  current  density  versus  applied  bias  if  the  range  of  the  fit  is 
restricted  to  current  densities  between  1  and  1000  A/ cm2  .  A  typical  fit  for  the  one  of  the 
cold  cathode  studied  here  is  shown  in  Fig.9(b)  with  the  values  of  the  parameters  J0  and  a 
in  Eq.(8)  shown  in  the  inset.  Table  III  gives  a  summary  of  the  parameters  J0  and  a  for  cold 
cathodes  of  different  width  and  with  the  physical  parameters  listed  in  the  caption  of  Fig.5. 
We  have  found  that  the  bias  dependence  of  the  dynamic  work  function  shift  is  also  of  the 
Fowler-Nordheim  type,  i.e, 


Ax  =  A0Hias2exp(— Vo/Kas).  (11) 

This  is  illustrated  in  Fig.  10  for  the  cold  cathode  with  the  same  parameters  as  in  Fig.9.  The 
values  of  Ao  and  Vo  a-re  indicated  in  the  inset  of  Fig.  10.  We  point  out  that  the  values  of  the 
parameters  B  and  Vq  in  Eqns.(9)  and  (10)  are  nearly  identical. 

Temperature  Rise  in  the  Cold  Cathode 

Hereafter,  we  derive  an  upper  estimate  of  the  temperature  rise  in  the  LaS  thin  film  as  a 
result  of  the  power  dissipation  mechanisms  discussed  above.  We  focuss  on  a  cold  cathode 
where  the  CdS  thin  film  is  deposited  on  a  InP  substrate  as  shown  in  Fig.l(a).  The  successful 
growth  of  cubic  CdS  thin  films  with  good  crystalline  quality  on  InP  substrates  has  been 
reported  recently  by  Shen  and  Kwok  [9].  For  the  case  of  a  InP/CdS  interface,  there  has  not 
been  any  report  of  the  conduction  band  discontinuity  A Ec  at  the  interface  between  the  two 
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materials,  to  the  best  of  our  knowledge.  For  that  reason,  A Ec  was  assumed  to  be  given  by 
Anderson’s  rule,  i.e,  A  in  Fig.  2  is  assumed  to  be  given  by  \x(InP)  —  x(C'd5’)|  =  0.2  eV 
[16].  This  estimate  was  based  on  the  measured  electron  affinities  of  InP  (4.4  eV)  and  CdS 
(4.2  eV)  reported  in  refs.  [17]  and  [IS],  respectively.  The  back  contact  to  the  substrate  is 
assumed  to  be  perfectly  ohmic  and  to  act  as  a  perfect  heat  sink  (300K).  Since  the  device 
area  (heat  source  formed  of  the  LaS  thin  fim)  is  much  thinner  than  the  substrate,  it  is 
necessary  to  consider  the  effect  of  heat  spreading  laterally  in  the  substrate.  Furthermore, 
we  assume  that  the  temperature  of  the  CdS  layer  will  be  the  same  as  the  LaS  top  layer. 
The  active  area  of  the  cathode  (CdS  and  LaS  layers)  is  therefore  assumed  to  be  acting  as  a 
heat  source  with  the  power  density  calculated  in  the  previous  section.  Because  of  the  finite 
thermal  conductivity  of  the  substrate,  we  expect  self-heating  effects  to  affect  the  operation 
of  the  cold  cathode  if  the  power  level  dissipated  in  the  active  area  of  the  cathode  becomes 
too  important.  Hereafter,  we  model  the  thermal  conductivity  of  the  InP  substrate  as  follows 

k(T)  =  *o(T/Ta)-\  (12) 


where  k0  is  the  thermal  conductivity  at  To  (300K),  n0  =  0.74  W /Kcm  is  the  room  temperature 
thermal  conductivity  of  InP  and  b=1.45.  Starting  with  Fick’s  law  and  making  use  of  a 
Kirchoff  transformation  to  take  into  account  the  temperature  dependence  of  the  thermal 
conductivity  of  the  InP  substrate  given  by  Eq.(l2),  it  can  be  shown  that  the  active  area  of 
the  cold  cathode  will  be  operated  at  a  temperature  given  by 


1 
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(13) 


where  To  is  the  ambient  room  temperature  (assumed  to  be  300  K  hereafter),  P&M  is  the  total 
power  dissipated  per  finger  as  calculated  in  the  previous  section,  and 


ZS 


dz 

Wr 


(14) 


where  Zs  is  the  thickness  of  the  InP  substrate.  Our  estimate  of  the  temperature  rise  in  the 
active  cold  cathode  area  will  give  an  upper  estimate  of  the  temperature  of  operation  since 
we  neglected  heat  conduction  to  the  top  Au  contacts  in  the  thick  portion  of  the  LaS  thin 
film. 
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III.  Current  Crowding  Effects  in  Proposed  Cold 
Cathode  with  a  Circular  Geometry 


In  this  section,  we  study  the  effects  of  current  crowding  in  the  case  of  an  emission  window 
with  circular  geometry  (see  Fig.l(c)).  The  lateral  potential  drop  in  the  circular  LaS  window 
satisfies  the  following  differential  equation  [19] 

f  =  7!'M-  W> 

where  i(r)  is  the  total  lateral  current  per  unit  length  flowing  outward  across  a  circle  of  radius 
r,  whose  center  coincide  with  the  center  of  the  emission  window.  The  lateral  current  satisfies 
the  following  equation 

27rn(r)  =  xR  f  dr'r'j(r).  (16) 

Jo 

If  we  further  assume  that  the  Fowler-Nordheim  emitted  current  j(r)  can  be  approximated 
by  Eq.(10)  over  the  range  of  dc  bias  considered  here,  the  following  second-order  differential 
equation  must  be  satisfied  by  the  lateral  potential  drop: 

(17) 

ar2  r  dr  t 


This  differential  equation  must  be  solved  subject  to  the  following  boundary  conditions  valid 
for  the  circular  geometry 

dV 

*-o.  -  <18) 


at  r  =  0,  and 

V{r  =  a)  =  Vbia3,  (19) 

at  the  edge  of  the  circular  window.  Introducing  the  reduced  variable  r  =  r/a  and  the 
quantity 

v  V  —  Vbia3 


Eq.(17)  can  be  recast  as  follows 


<PY  IdY  n  2  Y 

dr 12  +  r'  dr'  "  27  e  ’ 


where  72  =  and  the  parameter  (3  is  identical  to  the  one  defined  for  the  planar  problem 

[20]. 
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The  general  solution  of  Eq.(21)  can  be  found  analytically  and  is  given  by 

^  =  (c5V7!)[r'i'2/(l  -  *'S)\  (22) 


where  6  and  c  are  constants  to  be  determined  so  the  boundary  conditions  (18)  and  (19)  are 
satisfied.  Using  the  new  system  of  variables,  Eq.(18)  becomes 


o 

II 

o 

Mi 

-a  I’-a 

(23) 

while  Eq.(19)  now  reads 

K(1)  =  0. 

(24) 

Equation  (23)  leads  to 

6  =  2. 

(25) 

and  Equation  (24)  becomes 

s2  =  (1  -  c)W!c. 

(26) 

Combining  these  last  two  equations,  we  obtain  the  following  result 

2  4C 

7  "[1  -cp’ 

(27) 

Equation  (27)  can  be  solved  exactly  for  the  parameter  c 

c  =  [72  +  2-2^/72  +  l]/72 

(28) 

in  terms  of  which  we  can  write  vaxious  quantities  of  interest,  including  the  ratio 

J(0)/J(a)  =  [1  -  c]2, 

(29) 

characterizing  the  importance  of  current  crowding  in  the  circular  geometry.  Using  Eqn.(22), 
the  radial  dependence  of  the  lateral  potential  drop  if  found  to  be 

V(r)  =  Vbias  -  2 VTln[al~-~],  (30) 

a£(l  —  c) 

from  which  the  maximum  value  of  the  in  plane  electric  field  is  found  to  be 


Er(r  =  a)  = 


4c  Vj 
c  —  1  a 


(31) 


As  in  the  case  of  the  rectangular  geometry,  the  total  power  dissipated  in  the  LaS  thin 
film  is  given  by  the  sum  of  the  following  four  contributions  [20].  The  power  dissipated  by 
the  electrons  being  trapped  in  the  LaS  circular  thin  film  is  given  by 


Pi  =  2 tR  [  drj(r)V(r). 
Jo 
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The  power  dissipated  by  Joule  heating  as  trapped  electrons  move  to  the  edge  of  the  LaS 
window  is  given  by 

p2 — —lrlLr3{r)dr  i-  <33) 

The  third  contribution  to  power  dissipation  comes  from  from  Joule  heating  linked  to  the 
current  making  it  from  the  LaS  thin  film  to  the  Au  contacts  on  top  of  the  thick  LaS 
regions. 

P3  =  Rc(2irai(a))2.  (34) 

where  Rc  is  the  resistance  of  the  LaS  region  between  the  edge  of  the  LaS  thin  film  and  the 
top  Au  layer.  This  resistance  can  be  estimated  as  follows  [21] 


Rc  = 


psH  Ink 
t2xa  k  —  1  ’ 


where  k  =  b/t,  a  is  the  radius  of  the  circular  window,  and  H  is  the  height  of  the  thick  LaS 
region. 

Finally,  there  is  also  a  contribution  to  power  dissipation  due  to  the  blocking  effect  on 
the  Folwer-Nordheim  emission  current  emitted  under  the  wide  LaS  contacts: 

P4  =  Tb(b  +  2a)J0eaV>^\  (36) 

Starting  with  Eqns.(lO)  and  (30),  the  different  contributions  to  the  power  dissipation 
can  be  calculated  exactly  and  are  found  to  be 

Pi  =  ira2J0Re“Vb"'Vbias(l  -  c)  -  ira2J0ReaV^VT[, 2(1  -  c)  +  ln(l.  -  c)2],  (37) 

c 

P2  =  R{ira2  J0eaV»*’)VT[2{l  -  c)  +  2— ^/n(  1  -  c)2],  (38) 

c 

P3  is  given  by  Eq.(34)  and  P4  is  found  to  be 

RCR 2  2 

~  2(1  -Rfiem  ' 

where  is  the  total  emitted  current  through  the  circular  window 

Itm  =  2tt(1  -  R)  rj(r)dr.  (40) 

The  latter  can  be  calculated  explicitely  and  is  found  to  be 

Iem  =  (1  -  R) [tt a2  JoeaVbia,\(  1  -  c).  (41) 
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The  input  power  (per  emission  window)  delivered  by  the  power  supply  biasing  the  cold 
cathode  is  given  by 

f  a+6 

Pinput  =  /  2  irrj(r)V{r)dr,  (42) 

Jo 

which  can  readily  be  shown  to  be  given  by 

Pinput  —  Pi/R  +  Pa-  (43) 

The  power  efficiency  of  the  cold  cathode  can  be  calcualted  as  follows 


T}P  = 


Pinput  P Has 


and  the  temperature  rise  in  the  cathode  is  given  by  Eq.(13).  where  the  thermal  resistance 
Rth,o  must  be  calculated  for  the  case  of  power  dissipation  through  the  substrate  from  a  heat 
source  with  circular  geometry.  In  this  case,  we  find 

R„.o  =  Ty  (48) 

«o  a  [a  +  zstanv\ 

in  which  the  heat  spreading  angle  &  is  set  equal  to  45°  in  the  numerical  examples  below,  for 
simplicity. 

Numerical  Examples 


Figure  11  shows  the  variation  of  the  parameter  c  in  Eq.(28)  as  a  function  of  applied 
bias  for  cold  cathodes  with  circular  emission  window  of  different  radii.  For  all  cathodes, 
the  physical  parameters  are  the  same  as  listed  in  the  caption  of  Fig.5.  As  in  the  case  of  a 
rectangular  window,  we  use  the  criterion  that  current  crowding  is  negligible  if  the  lateral 
potential  drop  between  the  center  and  the  edge  of  the  LaS  circular  window,  V (r  =  0)  -  Vbias, 
is  kept  less  than  O.lVr-  Using  Eqns.(2S)  and  (30),  we  find  that  this  criterion  requires  the 
parameter  c  to  be  less  than  0.05.  This  limit  is  indicated  as  a  vertical  line  in  Fig.  11.  The 
family  of  curves  in  Fig.  11  is  parametrized  with  the  radius  of  the  emission  window.  Figure  11 
shows  that  the  range  of  dc  bias  over  which  current  crowding  can  be  neglected  in  a  circular 
window  is  comparable  to  the  range  of  dc  bias  over  which  current  crowding  is  negligible  in  a 
rectangular  window  whose  width  is  equal  to  the  radius  of  the  circular  emission  window  [20]. 

Figure  12  illustrates  the  importance  of  current  crowding  on  the  lateral  potential  drop 
in  emitter  windows  of  different  radii.  The  left  frames  show  the  radial  dependence  of  the 
electrostatic  potential  for  four  different  values  (1,10,100,1000  A/cm2)  of  the  current  density  at 
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the  rim  of  the  circular  LaS  window.  The  right  frames  in  Fig.  12  show  the  corresponding  radial 
dependence  of  the  emitted  current  density.  From  Fig.  12,  it  can  be  seen  that  current  crowding 
is  negligible  in  emitter  windows  with  radius  less  than  50  ptm  if  the  emitted  current  density 
is  kept  under  10  A/ cm2.  As  in  the  case  of  LaS  windows  with  rectangular  geometry,  Figure 
12  shows  that  the  current  density  profiles  are  much  more  sensitive  to  the  finite  resistivity  of 
the  LaS  thin  film  than  the  lateral  potential  drop. 

Figure  13  is  a  plot  of  the  four  contributions  to  the  total  power  dissipated  in  cold 
cathodes  with  different  radii  plotted  as  a  function  of  the  parameter  c.  For  all  cathodes,  the 
power  dissipation  due  to  Joule  heating  in  the  LaS  thin  film  and  the  thick  LaS  regions  is 
negligible  compare  to  the  power  released  by  electrons  being  trapped  in  the  LaS  thin  film  and 
by  electrons  blocked  in  the  thick  LaS  regions.  The  latter  is  always  about  one  of  magnitude 
higher  than  the  former.  Figure  13  shows  that  substantial  power  dissipation  occurs  in  the 
cathode  with  radius  under  50 ptm  while  the  cathode  is  still  operating  without  any  substantial 
current  crowding  effects  (i.e,  c  <  0.05).  The  power  efficiency  rjp  of  cathodes  of  different 
width  is  plotted  as  a  function  of  Vbias  and  the  emitted  current  density  in  Figs.  14(a) 
and  14(b),  respectively.  For  all  window  size,  the  efficiency  decreases  with  K»0a  and  Jem  as  a 
result  of  current  crowding.  The  efficiency  is  more  or  less  constant  over  a  wider  range  of  Vbias 
for  window  with  smaller  radius  because  current  crowding  is  less  important  in  that  case,  as 
illustrated  in  Fig.  12.  The  overall  lower  efficiency  for  window  with  smaller  radius  illustrated 
in  Fig.  14  comes  from  the  fact  that  the  width  of  the  thick  LaS  regions  was  set  equal  to 
100 fim  for  all  cathodes.  The  efficiency  of  cathodes  could  be  increased  by  making  the  ratio 
b/a  in  Fig.  1(b)  closer  to  unity. 

Figure  15  shows  the  temperature  of  the  active  area  of  a  cold  cathode  with  the  param¬ 
eters  listed  in  Table  I  as  a  function  of  Vbia3  for  emitter  window  with  different  radii.  The 
thickness  of  the  LaS  contacts  and  InP  substrate  was  set  equal  to  100A  and  100/xm,  respec¬ 
tively.  As  in  case  of  cold  cathodes  whose  emission  window  as  a  rectangular  geometry  [20], 
Fig.  15  indicates  that  to  limit  the  temperature  rise  in  any  cathode  to  less  than  200  K,  the 
dc  bias  must  be  limited  to  a  smaller  range  for  emitters  with  smaller  window  radius.  For 
instance,  according  to  Figures  11  and  15,  a  cathode  with  a  20 \im  diameter  can  be  operated 
up  to  8.3  V  with  negligible  self-heating  effects  (AT  around  100  K  ).  For  that  bias,  Figure  12 
indicates  that  current  crowding  would  be  negligible  in  the  cathode  and  the  emitted  current 
density  would  be  around  100  A/cm2  (See  Fig.  16).  On  the  other  hand,  Figure  11  indicates 
that  a  lOO^rn  diameter  window  can  be  operated  up  to  6.8  V  before  current  crowding  becomes 
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non  negligible.  At  this  bias,  the  emitted  current  density  would  be  around  15  A/cm2  (Fig. 
16)  while  the  temperature  rise  in  the  device  would  be  only  about  15  K  as  shown  in  Fig.  15. 

IV.  CONCLUSIONS 

We  have  proposed  a  new  cold  cathode  emitter  which  consists  of  a  thin  wide  bandgap  semi¬ 
conductor  material  sandwiched  between  a  metallic  material  or  heavily  doped  semiconductor, 
and  a  low  work  function  semimetallic  thin  film.  We  have  shown  that  the  capture  of  electrons 
by  thin  semimetallic  layers  grown  on  the  escape  surface  of  wide  bandgap  semiconductors  can 
lead  to  a  dynamical  shift  of  the  work  function  of  the  semimetallic  layers  together  with  an 
increase  of  the  cathode  emission  current.  While  varying  the  device  and  physical  parameters 
of  the  structure,  our  studies  suggest  that  any  mechanism  which  promotes  additional  charge 
deposit  in  the  well  enhances  the  dynamic  work  function  shift  phenomenon  thereby  increasing 
the  emitted  current.  Potential  material  candidates  were  proposed  for  cold  cathode  operation 
with  applied  bias  under  10  V,  with  current  densities  approaching  several  tens  of  A/cm2,  and 
with  large  power  efficiencies  ( tjp  approaching  15  %). 

The  results  of  our  analysis  show  that  a  cold  cathode  with  either  a  rectangular  or  circular 
emission  window  and  with  the  parameters  listed  in  Tables  I  and  II  would  emit  a  uniform 
current  density  of  about  15 A/cm2  at  a  dc  biasing  voltage  of  about  8V.  For  that  bias,  the 
effects  of  current  crowding  would  be  negligible  and  the  temperature  rise  in  the  active  area 
of  the  cathode  (CdS/LaS  layers)  as  a  result  of  self- heating  effects  would  be  negligible. 

Further  improvements  to  the  theory  should  include  a  more  realistic  model  for  the  emis¬ 
sion  current  and  transport  through  the  wide  bandgap  material.  Also,  the  semimetallic  film 
energy  density  of  states  (to  account  for  the  d-band  character  of  the  conduction  band  in  the 
chosen  rare-earth  semimetallic  samples  [14]),  the  finite  probability  for  electron  wavefunctions 
in  the  thin  semimetallic  films  to  extend  in  the  semiconductor  material  [22],  a  more  accurate 
description  of  the  energy  loss  mechanisms  [23]  and  screening  effects  (including  the  lateral 
ohmic  voltage  drop)  in  thin  semimetallic  layers  [24].  Finally,  our  study  of  self-heating  effects 
should  include  partial  cooling  of  the  cathode  due  to  heat  conduction  through  the  thick  LaS 
layers  which  was  neglected  in  this  study.  The  latter  would  allow  to  extend  slightly  the  dc 
biasing  operating  range  of  the  cathode  beyond  the  estimate  reported  here.  Furthermore,  the 
thermal  and  electrical  models  of  the  cathode  described  here  should  be  solved  self-consistently. 
Once  all  these  effects  are  taken  into  account,  we  believe  the  quantitative  operation  of  the 
cold  cathode  exposed  here  will  stay  essentially  correct  predicting  a  dynamical  shift  of  the 
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work  function  of  the  thin  semimetallic  film  of  the  same  order  of  magnitude  than  the  one 
reported  here. 

Our  analysis  provides  the  basic  design  rules  to  fabricate  a  new  cold  cathode  with 
emission  windows  with  a  rectanguler  or  circular  geometry.  The  growth  of  the  structure 
would  require  the  epitaxial  growth  of  the  structure  shown  in  Fig.l.  As  discussed  above,  the 
epitaxial  growth  of  InP/CdS  heterostructures  has  been  reported  in  the  literature  in  the 
past  [9].  The  deposition  of  epitaxial  LaS  thin  films  has  not  been  reported,  to  the  best  of 
our  knowledge.  We  believe,  however,  that  the  figures  of  merits  of  the  various  cold  cathodes 
analyzed  in  this  work  are  a  strong  incentive  towards  the  experimental  investigation  of  these 
devices.  If  successful,  such  an  experimental  effort  would  lead  to  big  pay-offs  with  the  design 
of  highly  efficient  cold  cathodes  for  large  panel  displays,  IR  image  convertors  and  sensors,  and 
active  power  devices  in  mobile  and  airborne  electronic  equipment  for  military,  commercial, 
and  private  use. 
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A  NOVEL  COMPATIBILITY/EQUILIBRIUM  BASED  ITERATIVE 
POST-PROCESSING  APPROACH  FOR  AXISYMMETRIC  BRITTLE 

MATRIX  COMPOSITES 

Reaz  A.  Chaudhuri,  Associate  Professor 
Department  of  Materials  Science  &  Engineering,  University  of  Utah 

Abstract 

A  semi-analytical  iterative  approach  for  enhancing  the  existing  two-dimensional  quasi- 
continuous  axisymmetric  stress  field  for  a  brittle  matrix  micro-composite  (i.  e.,  a  single  fiber 
surrounded  by  a  concentric  matrix  cylinder),  is  presented.  The  existing  solution  employs 
Reissner’s  variational  theorem  in  conjunction  with  an  equilibrium  stress  field  in  which  the  radial  (r- 
)  dependence  is  assumed  a  priori. 

In  the  present  approach,  the  stress  distribution  in  the  radial  direction  obtained  from  the 
afore-cited  variational  model  is  improved  a  posteriori  through  an  iterative  approach  that  involves 
successive  substitution  of  the  previously  computed  strains  (or  stresses)  into  the  equations  of 
compatibility  and  equilibrium.  The  boundary/interface  conditions  at  r  =  constant  surfaces  are 
satisfied  in  the  pointwise  sense.  However,  this  process  leaves  the  end  boundary  conditions  in  the 
axial  direction  (i.e.,  at  surfaces  z  =  constant)  undefined  thus  rendering  the  boundary-value 
formulation  ill-posed.  This  ill-posedness  is  removed  by  introducing  appropriate  boundary  error 
terms  that  help  satisfy  the  end  boundary  conditions  at  surfaces  z  =  constant.  As  a  first  step,  an 
approximate  plane  strain  version  of  the  present  solution  is  implemented  in  a  FORTRAN  code.  An 
illustrative  thermal  stress  problem  is  solved  and  used  to  compare  with  the  existing  variational 
solution . 
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A  NOVEL  COMPATIBILITY/EQUILIBRIUM  BASED  ITERATIVE 
POST-PROCESSING  APPROACH  FOR  AXISYMMETRIC  BRITTLE 

MATRIX  COMPOSITES 

Reaz  A.  Chaudhuri,  Associate  Professor 
Department  of  Materials  Science  &  Engineering,  University  of  Utah 

1.  Introduction: 

Studies  of  the  behavior  of  unidirectional  and  laminated  composites  made  from  stiff  elastic 
matrix  materials  which  may  develop  imperfect  interfaces  with  the  fibers  have  enjoyed  a  revival  after 
the  early  classical  work  of  Aveston,  Cooper,  and  Kelly  (1971).  Pagano  (1991)  refers  to  these 
materials  as  brittle  matrix  composites  (BMC).  The  aforementioned  ACK  modeling,  as  well  as  the 
more  recent  development  proposed  by  Budiansky,  Hutchinson,  and  Evans  (1986),  are  based  upon 
primitive  approximations  of  the  stress  field  developed  within  a  concentric  cylinder,  i.e.,  a  circular 
cylindrical  body  of  one  material  surrounded  by  a  concentric  annulus  or  ring  of  a  second  material. 
There  exists  a  considerable  body  of  literature  associated  with  the  elasticity  problem  of  a  concentric 
cylinder,  where  modem  interest  is  focused  on  its  use  as  a  representative  volume  element  (RVE)  of 
a  unidirectional  composite  (Hashin  and  Rosen,  1964;  Pagano  and  Tandon,  1988).  Additionally,  a 
significant  segment  of  the  composite  literature  is  based  upon  the  one-dimensional  shear  lag 
analysis,  which  was  apparently  originated  by  Cox  (1952).  The  results  obtained  using  this  kind  of 
analysis  are  too  inaccurate  to  merit  further  attention  in  this  report 

Sternberg  (1970)  solved  several  axisymmetric  load  diffusion  problems  within  a  concentric 
cylindrical  domain,  in  which  the  elasticity  formulations  are  simplified  by  the  assumption  that  the 
core  material  (fiber)  can  be  modeled  as  a  one-dimensional  bar  while  the  annular  region  (matrix)  in 
all  cases  extends  to  infinity.  An  exact  solution  was  derived  for  the  case  in  which  the  bar  was 
circular  in  cross-section,  and  was  fully  immersed  within  the  unbounded  matrix.  A  similar 
approach,  with  the  difference  of  the  fiber  being  assumed  rigid,  was  employed  by  Luk  and  Keer 
(1979).  This  assumption  is  more  appropriate  for  resin  matrix  composites,  where  the  fiber  to 
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matrix  modulus  ratio  is  very  high.  The  axisymmetric  elasticity  problem  of  a  broken  fiber  embedded 
in  an  infinite  matrix  was  treated  by  Pickett  and  Johnson  (1967).  In  that  work,  the  fiber  is 
represented  as  a  three-dimensional  elastic  medium;  however,  the  report  contained  no  numerical 
results  for  the  stress  field.  Smith  and  Spencer  (1970)  also  formulated  an  axisymmetric  elasticity 
solution  by  a  semi-inverse  method  for  a  class  of  boundary  value  problems  in  which  the  radius  and 
length  of  the  concentric  cylinder  are  both  finite.  The  solution  is  expressed  in  the  form  of  a  series  of 
non-orthogonal  functions  which  satisfy  the  field  equations  exactly.  Homogeneous  boundary 
conditions  on  the  radial  surface  of  the  body  are  satisfied  exactly  while  realistic  end  conditions  can 
be  approximated.  The  singularities  predicted  by  Zak  (1964)  are  smoothed  out  in  this  approach.  A 
very  formidable  study,  which  includes  correlation  with  experimental  observations,  is  that  by 
Atkinson  et  al  (1982).  In  this  work,  the  pullout  of  a  single  fiber  from  a  matrix  cylinder  is  treated. 
The  fiber  extends  only  partially  along  its  length  into  the  matrix.  Results  are  provided  for  a 
perfectly  bonded  fiber-matrix  interface  as  well  as  for  states  in  which  lateral  (curved)  surface 
debonding  or  fiber  end  plane  debonding  take  place.  The  problem  is  solved  by  "patching"  the 
asymptotic  singular  stress  field  to  that  given  by  finite  elements,  although  this  method  is  not  always 
reliable.  An  interesting  conclusion  is  reached  that,  at  least  in  a  qualitative  sense,  the  interface 
failure  response  can  be  anticipated  from  the  stress  field  within  the  uncracked  rod.  In  a  model 
similar  to  Pagano's  (1991),  McCartney  (1990)  treated  a  class  of  concentric  cylinder  problems  in 
which  matrix  cracking  or  debonding  with  or  without  friction  are  present.  In  that  work,  the 
functional  r-dependence  of  the  stress  components  is  assumed  which  leads  to  a  system  of  ordinary 
differential  equations  in  z.  All  appropriate  field  equations  are  satisfied  with  the  exception  of  two  of 
the  constitutive  relations  while  some  of  the  boundary/interface  conditions  could  only  be  satisfied  in 
an  average  sense.  The  ease  and  potential  effectiveness  of  this  model  for  composite  analysis  will 
demand  its  careful  consideration  in  comparison  with  exact  solutions  and  solutions  given  by 
competitive  approaches.  For  example,  Kurtz  and  Pagano  (1991)  formulated  an  infinite  series 
solution  of  the  axisymmetric  elasticity  problem  in  which  a  fiber  is  being  pulled  from  the  matrix. 
The  length  of  the  body  as  well  as  the  outer  radius  are  finite.  Although  the  singularity  is  not  explicit 
in  the  solution,  Cesaro  summation  has  been  employed  to  improve  the  convergence  of  the  stress 
field  within  the  singular  region. 
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The  solution  for  the  stated  problem  has  been  obtained  by  employing  a  modified  version  of 
the  variational  model  (Pagano,  1991)  of  an  axisymmetric  concentric  cylinder,  which  was 
successfully  implemented  earlier  in  the  case  of  a  flat  laminate  by  Pagano  (1978).  The  model  is 
generated  by  subdividing  the  body  into  regions  consisting  of  a  core  and  a  number  of  shells  of 
constant  thickness  and  length  and  satisfying  the  Reissner  variational  equation  (1950)  with  an 
assumed  stress  field  in  each  region.  The  number  of  regions,  in  particular  in  the  r  direction,  can  be 
increased  in  order  to  improve  solution  accuracy.  The  regions  are  selected  such  that  the 
thermoelastic  properties  are  constant  and  the  boundary  conditions  do  not  change  character  on  any 
of  the  bounding  surfaces  within  each  region.  Pagano  (1991)  has  sought  to  predict  the  influence  of 
various  kinds  of  damage  (see  Figure  1)  and  their  interactions  by  accurately  modeling  the  micro¬ 
mechanical  stress  field  in  their  presence  by  using  the  afore-cited  variational  approach. 

Strengths  and  Weaknesses  of  the  Existing  Approach  (Pagano,  1991) 

Strengths: 

1.  This  assumed  stress  based  approach  insures  satisfaction  of  the  axisymmetric  equilibrium 
equations; 

2.  The  existing  method  utilizes  a  reasonably  accurate  non-singular  axial  (z-direction)  variation  of 
the  computed  stress  field  that  also  satisfies  the  end  (z  =  const)  boundary  conditions  (i.  e.,  no 
artificial  discontinuity  due  to  sectioning). 

Weaknesses: 

1.  Layering  in  the  radial  direction,  that  introduces  artificial  discontinuities  in  some  stress 
components  at  a  layer  interface  within  the  same  material; 

2.  As  the  number  of  layer  increases,  the  computed  eigenvalues  become  numerically  very  large, 
thus  limiting  the  number  of  layers  required  for  accurate  stress  field  in  the  vicinity  of  a  stress 
singularity  point. 

Objectives  of  the  Current  Research: 

The  present  method  seeks  to  alleviate  the  afore-mentioned  weaknesses  of  the  existing  approach, 
while  preserving  its  inherent  strengths.  The  specific  goals  of  the  present  investigation  includes 
-  Improvement  of  the  radial  (r-)  variation  of  the  stresses  so  that  subdivision  into  very  thin  layers 
and  the  associated  blow-up  of  the  computed  eigenvalues  can  be  avoided, 
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-  Artificial  discontinuities  of  some  stress  components,  such  as  a0  at  a  layer  interface  within  the 
same  material  can  be  avoided. 

2.  Method  of  Analysis 

Starting  point  of  the  present  research  is  Pagano's  (1991)  layerwise  axisymmetric  solution 
for  fiber-matrix  concentric  cylinder  model,  based  on  Reissner's  variational  theorem  (1950)  in 
conjunction  with  an  equilibrium  stress  field,  in  which  the  radial  (r-)  dependence  is  assumed  a 
priori.  An  approximate  model  was  formulated  to  define  the  thermoelastic  response  of  a  concentric 
fiber-matrix  cylindrical  body  under  axisymmetric  boundary  conditions.  The  interfaces  between 
continguous  cylinders  may  be  either  continuous  or  subjected  to  mixed  traction  and  displacement 
boundary  conditions.  The  external  surfaces  may  be  subjected  to  mixed  boundary  conditions  that 
are  consistent  with  the  model  assumptions  but  otherwise  arbitrary. 

In  this  paper,  an  improved  stress  field  within  a  layer  is  derived  starting  from  Pagano's 
layerwise  axisymmetric  fiber-matrix  concentric  cylinder  solution.  The  stress  distribution  in  the 
radial  direction  obtained  from  the  afore-cited  layerwise  fiber-matrix  concentric  cylinder  model  is 
improved  a  posteriori  through  an  iterative  approach  that  involves  successive  substitution  of  the 
previously  computed  strains  (or  stresses)  into  the  equations  of  compatibility  and  equilibrium.  A 
similar  procedure  was  implemented  in  the  post-processing  part  of  a  layerwise  finite  element  code 
for  analysis  of  quasi-three-dimensional  laminated  plates/shells  to  obtain  a  more  accurate  through¬ 
thickness  distribution  of  interlaminar  shear  stresses  (see  Chaudhuri  and  Seide,  1987a;  and 
Chaudhuri,  1990).  The  boundary/interface  conditions  at  r  =  constant  surfaces  are  satisfied  in  the 
pointwise  sense,  thus  eliminating  artificial  discontinuity  in  computed  CTe(r,  z)  and  oz(r,z)  across  a 

layer-interface  within  the  same  material.  In  what  follows,  the  axisymmetric  fiber-matrix  concentric 
cylinder  is  assumed,  for  the  purpose  of  illustrating  the  potential  of  the  new  method,  to  be  divided 
into  two  layers  —  the  core  or  fiber  layer  and  the  annulus  or  matrix  layer.  First,  the  method  of 
deriving  the  improved  stress  field  is  illustrated  for  the  fiber  layer,  which  is  then  followed  by  the 
same  for  the  matrix  layer. 
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Improved  Stress  Field  in  the  Core  (Fiber  Laver) 

Pagano  (1991)  in  his  axisymmetric  fiber-matrix  concentric  cylinder  model  has  assumed  a 
linear  variation  of  axial  stress,  gz(t,z)  and  hoop  stress,  Gq(t,z),  with  respect  to  r  within  a  fiber 

or  matrix  layer.  This  is  consistent  with  the  assumption  of  Love-Kirchhoffs  thin  shell  theory. 
However,  he  derived  the  interlaminar  shear  stress,  xra(r,  z),  and  the  radial  stress,  crr(r,z),  using 
the  equations  of  equilibrium  in  line  with  his  earlier  work  (see  Pagano,  1969)  before  substituting 
these  stresses  into  Reissner's  (1950)  variational  principle.  Pagano  (1969)  had  shown  that  these 
stresses  are  improved  to  the  extent  that  they  are  reasonably  close  to  their  elasticity  theory  based 
counterparts.  Chaudhuri  (1986),  and  Chaudhuri  and  Seide  (1987a)  successfully  implemented  this 
method  in  a  finite  element  based  post-processing  approach  for  computing  interlaminar  shear  stress 
distribution  through  the  thickness  of  a  symmetric  laminate.  Pagano's  (1991)  stress  field  for  the 
fiber  layer  (core)  is  as  shown  below: 

<*z(r,z)  =  Pn(z)— — -  +  Pi2(z)“~'  (la) 

r2  r2 

^rz(r»z)  =  P52(z)—  +  p53(z)(r-r2)r  (lb) 

r2 

arf(r,z)  =  pliteX— 4+p52(z)-  +  p33(z)(r2  -r22)r  +  p34(z)(r-r2)r  (lc) 

r2  r2 

A  bar  on  the  stresses  indicates  that  these  are  computed  using  the  axisymmetric  variational  model 
due  to  Pagano  (1991).  The  general  form  of  Pagano's  (1991)  solution  for  any  of  the  dependent 
variables  P(z)  is  expressed  by 

P(z)  =2Ai^z  +  Pp(z)  (2) 

i 

within  each  constituent  where  Ai  are  constants,  Xi  are  eigenvalues  of  a  determinant,  and  Pp(z)  is  a 
particular  solution,  which  in  the  present  case  is  a  simple  polynomial.  Further  details  of  the  solution 
procedure  including  the  method  for  determining  the  higher  order  eigenvector  and  higher  order 
particular  solution  are  discussed  by  Brown  (1992). 

It  is  worthwhile  to  point  out  here  that  although  the  barred  stresses  satisfy  the  equations  of 
equilibrium  for  an  axisymmetric  elastic  body,  the  corresponding  strains  fail  to  satisfy  the  equations 
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of  compatibility  in  the  pointwise  sense.  It  is  noteworthy  that  although  both  the  plane  strain  and 
axisymmetric  deformations  represent  two-dimensional  states,  the  latter  case  requires  four 
compatibility  equations  to  be  satisfied  by  the  strains  computed  using  Pagano’s  (1991) 
axisymmetric  variational  model.  This  is  in  contrast  to  the  case  of  plane  strain,  wherein  only  one 
compatibility  equation  out  of  the  six  is  not  an  identity.  However,  if  the  hoop  stress,  ct0,  is  derived 

using  the  exact  axisymmetric  elasticity  based  kinematic  relations  and  Hooke’s  law,  then  three  of  the 
four  compatibility  equations  required  by  the  axisymmetric  elasticity  theory  become  identities.  The 
combined  kinematic  and  stress-strain  relations  for  axisymmetric  elasticity  theory  are  given  as 
follows  (Timoshenko  and  Goodier,  1959): 


u(r,z)  =  re9(r,z)  =  ^{ae  -  v(ar  +  cz)} 

E 

9u(r,z)  .  .  1  ,  _ .. 

^  =  er(r>z)  =  -{or  -v(oe  +  az)} 

Elimination  of  u(r,z)  from  eqns  (3),  followed  by  integration  with  respect  to  r,  yields 


(3a) 

(3b) 


a0(r,z)  =  r~(1+v) 


(l  +  v)rvCTr(r,z)  +  vrl 


(i+v)  dar  (r,  z)  ^  ^_(i+v)  d<Jz  (r,  z) 


dr 


dr 


dr  +  F0(z) 


(4) 

where  Fq(z)  is  an  arbitrary  function  of  z.  Substitution  of  a,  (r,z)  and  az(r,z)  from  eqns  (la,b) 
into  the  right  side  of  eqn  (4)  yields 


-f  f  1  _ 

ae(r,z)  =  a0  *  (r,z)  +  -j—  D0(z) 

r  f 

where 


(5a) 


cfe  * (r, z)  =  (1  +  vf XP3 1 (z)( — ~r - +  P32 (z)  7  r  —  +  P33 (z)(— ^  — ^-7) 

Vf+1  r2(Vf  +  2)  T2(Vf-t-2)  Vf  +4  Vf  +2 

2  3 

+p54(z)(^-t - ~r)]  +  Vf E-pSi(z)- — +  P32 (z)  7  r  —  +  P33(z)(-— — 

Vf  +  3  Vf  +  2  (Vf+2)r2  r2(vf+2)  vf  +  4 


2r2 


— ^TT)  +  P34(zX  . 

vf  +2  vf  +3  vf  +2 


— ) "  Pn(z>  -  ^  .  +  P12 (z)  —~t\ 

r2(vf  +  2)  vf  +2 


(5b) 
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The  corresponding  strains,  er  and  yrz,  can  now  be  obtained  by  substitution  of  the  above  stresses 
into  Hooke's  law: 


er  ^[Sr-vC^e  +  Sz)]; 
E 


Yrz  —  G  ^rz 


(6a, b) 


It  may  be  noted  that  the  above  two  strains,  er  and  yK,  are  not  compatible  with  the  axial  strain,  £z, 
computed  using  Hooke's  law 

e2  =  ttC^z  “  v(CTe  -t-^r)]  (6c) 

E 


because  the  remaining  compatibility  equation 


£gz  £gr  _  £Yrz 

5r2  5z2  9r3z 


is  not  satisfied.  The  axial  strain  is,  therefore,  obtained  by  substituting  £r  and  yra  given  by  eqns 
(5a,b)  into  the  integrated  (twice  with  respect  to  r)  version  of  the  compatibility  equation  (7) 


£z  (r,  z)  =  -  JJ  ^-%dsdr  +  J  +  rFi  (z) +  F2  (z) 


which  yields 

£z(r,z)  =  £z  *  (r,z)  -  ^f-D'0(z)(log  r  - 1)  +  -^-{rD^z)  +  D2(z)}  1 

Ef  Gf 

where 

* (r>z)  =  + p“(z)^ + p*(z)7(75  "  J>+ P«(Z)T<§  - r*)] 

-Vf(l  +  V()[p31(Z)(^-i^^}  +  pM(z)^^  +  P;,(z)fi5^ 


_2_3  _4  r  _3 

- - - }  +  P34(z)( - - - 

6(vf+2  Y  F34WV12(vf+3)  6(vf+2) 


3  3  3 

— - )]  -  V?[-P3i  (z) - - - 1-  P32(z) - 

vf  +2)  fL  F3iw6(vf  +  2)r2  F32w6(vf+2)r2 
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3f5  2  3  43  3 

+P33 <Z)<20(vf  +  4)  “ 6(vf+2))  +  P34(Z)(6(3  +  vf) ” 6(2+vf)) “ Pu<Z)6r2(2  +  vf) 


+p‘2(z)557^)] + d“(z)7) + i(p“(z)i" + P53(Z)(H  " r2  J)] 


It  may  be  noted  that  the  superscript  f  on  p^  is  dropped,  in  the  interest  of  notational 
convenience,  in  eqn  (9b)  and  in  those  that  follow.  The  corresponding  normal  stress,  az ,  can  be 
obtained  by  using  Hooke's  law  as  follows: 


(r>z)  =  Ef  (r,  z)  +  vf  (aj  (r,  z)  +  ( r,  z)} 


It  is  noteworthy  that  although  the  axial  strain  or  its  stress  counterpart,  dz,  satisfies  the 

compatibility  eqn  (7),  it  is  no  longer  in  equilibrium  (in  the  pointwise  sense)  with  the  stresses, 
T^and  6r.  These  stresses  are,  therefore,  rederived  from  the  following  two  equilibrium  equations: 


dx„  do7  _ 

dr  r  dz 


(Ha) 


daT  ,  <3r  —  CTq  |  d^rz  _  Q 
dr  r  dz 


(lib) 


xn  and  dr  can  now  be  obtained  from  eqns  (11)  upon  integration  as  follows: 


^iz(riz)  =  ~[~Jr~ +  F3(z)] 
r  J  dz 


(12a) 


ar(r,z)  =  i[Jaedr- Jr^Sdr  +  F4(z)] 


(12b) 


c 

Substitution  of  dz(r,z),  given  by  eqn  (10)  into  the  integrated  equilibrium  equation  (12a)  yields 


T^(r,  z)  -  ■£  *  (r,z) + Dj  3— H  +  D0  (z)  1  +  -£-  {D;  (z)  +  D2(z)}  +  £i-(z) 

2-vf  (l_Vf)rvf  2Gf  r 


(13a) 


where 
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i «  *  (f’  Z>  =  P31  (zX15rf^_4r  >  +  P32 (Z>^T  +  P33 (ZX^  -  ^5")  +  P34(Z)(^  “  ^ 


72  30 


3  ,4  r4  r6 

”EfVf  (1  + Vf)[p3I(z){^— P32(z)^^jJ^7+  P33(zM'J^^j7^j 

~ 30 (vf  +  2) 1  +  P34 (Z),72(vf  +  3)  ”  30(vf  +  2) 11  E,V,[  P3I<2)30(vf +2)r2 

4  ,24  _5  4 

+p32<2) 30(vf~)'rl +  P33<Z),IS(^)  _  +  P34  Z  36(v+3)  “ ^2? 

'p:,(z)w^+pl"2(z,5o^]+t[P32<z)3k+p” 


4  ,2,2 


.3  ,2, 


+Vf[P3l(zX3r\o'2r  ■  )  +  P32(zX~)  +  P33(z>(y  -^J“)  +  P34 (zX“j 


4  3 


+Vf (1  +  Vf )[P3i (Z){^^ - +  P32 (Z> +  P33 (Z){^4 “  \^+2 } 


+P34(zX4(Vf  +3)  3(Vf  +2)^  +  Vf [  P3l(Z)3(vf  +2)r2  +  P32(Z)3(vf  +2)r2 
+P33  (Z){5(vf+4) '  3(vf  +3)}  +  P34  (Z)(2(vf  +  3»  ~  3(vf  +  2)}  ”  Pl  1  (Z>  3(vf+2)r2 


+p12(z) - ] 

F12  3(vf+2) 


(13b) 


Substitution  of  de(r,z)and  T^frz)  given  by  eqns  (5)  and  (13)  into  the  second  integrated 
equilibrium  equation  (12b)  yields 


df(r,z)  =  aj  *(r,z)-(l+vf)D0(z)vfr1  Vf  +p0(z)^— — 

+D»  <z>,,  --'TT - + + D*<z»  *  D »  <z> + 

(2- vf)(3- vf)  6Gf  r 


(14a) 
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where 


orf  *  (r,  z)  =  (1  +  vf  )[p3  i  (z){  1  -  ■■■"■  }  +  p32  (z)  — — 

(vf  + 1)  2(vf  +  2)r2  2(vf  +  2)r2 

3  2  2 

+p33(z){ — - ^ — }  +  p34(z){ - - - M - }]  +  vf[-p31(z) - - - 

F33Wl4(vf+4)  2(vf+2)  34  3(vf+3)  2(vf+2)  f  F31  2(vf+2)r2 


+P32(Z) 


2(vf  +  2)r2 


+  P33(Z){ 


4(vf  +  4)  2  2(vf+2) 


}  +  P34(z){ 


r  r 
- } 


3(vf  +  3)  2(vf  +  2) 


Pll(Z)2(vf  +  2)r2  +Pl2(Z)2(vf  +2)]  [Psi(z)(  40  180}r2  +P32(z)180r2 


r7  r2r5  ....  r6  r  r5  ..  r4  r5 

+p33(z)( - 2_)  +  p  (Z)( - 2— )  -  Efvf  (1  +  Vf )[p31  (z){ - - - - - } 

F33  1120  180  F34  504  180  ff  f  F31  40(vf+l)  180(vf+2)r2 


_5  _7  2  5  6 

♦»«  r  •**»  i  r  mi  r 

+p32(z) - +  p33(z){ - - - }  +  p34{ - 

F32  180(vf  +  2)r2  F33  960(vf+4)  180(vf  +  2)  F34  504(vf+3) 


- ^ - }]  -  Efv2  [-p31(z) - - - +  p32(z) - - - 

180(vf  +  2)  f  f  F31  180(vf+2)r2  F32  180(vf+2)r2 


or7  r2r5  „  _6  r  5 

+p33  (z){ - - - }  +  p34  (z){ - - - } 

1120(vf  +4)  180(vf  +  2)  F34  2  52(vf+3)  180(vf+2) 


r5  i  Ef  r  ....  .  .  r5  ....  .  .  .  r6  r2r5, 

_ 1  J _ _ I  (  r*\( _ 4 _ 


— Pll(z) - V  Pl2(z) - ]  H - HPssCz) - Y  pc*(z)( - - - )] 

F11W180(vf  +  2)r2  F12W180(vf+2)r2J  Gf  52W180r2  P53V  A504  180;J 


+vf  {p3iUX“r-  -  tt)  +  P32(z)t^—  +  P33(z)(rr  "  +  Pm(z)(^  “ 


6  12 


30  12 


20  12 


— - }  +  p34(z){ - - - - }]  +  v2  [— p31(z) - — - 

4(vf  +  2)  20(vf+3)  12(vf  +  2)  12(vf+2)r2 


,  -  ,  ^  r3  "  ,  3r5  r2r3  ,  r4 

+p  (z) - h  p33(z){ - - - }  +  p34(z){ - 

F32W12(vf  +  2)r2  F33Wl30(vf  +  4)  12(vf+3)'  F34  A10(vf  +  3) 
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(14b) 


-  r3  r3 

— - }  -  pn(z) - 


12(vf  +2) 


- +  p12(z) - ] 

12(vf  +  2)r2  F12  12(vf  +  2) 


-leld  in 


The  procedure  illustrated  above  for  the  fiber  layer  can  easily  be  repeated  for  the  matrix 
layer.  Pagano's  stress  field  for  the  matrix  layer  (annular  region)  is  as  shown  below: 


Ozm)  =  Pn(4  7-7  +Pi2(4  7-7 
Vr2-r17  vr2-rl 


(15a) 


^  r(r2-r)  .  _  r(r“ri)  ,  _  ,J ft +  r2)r2 -(r2 +  iir2 +  r22)r  1 

*rz  =  P5l(z)  ,  - r  +  P52(z)— - -7  +  P53(z)  - T2 - +  7 


rl(r2  _rl) 


r2(r2-ri) 


(15b) 


oJm).=  P3l(z)f-^— ^  V  P32(z{-?— V  P33(z)(r3  -  (rl2  +  hh  +  r2  )r  +  rlr2(rl  +  r2>) 

Vr2-rlJ  Vr2_rlJ 

+p34(z)(r2  -  (ij  +  r2)r  +  r^)  +  p35(z)(r2  -  (rt  +  r2)r  +  r^)— (J5c) 

Following  the  procedure  illustrated  above,  the  improved  hoop  stress,  c“(r,z),  axial  strain, 
e“(r,z),  interlaminar  shear  stress,  t™(r,z),  and  the  radial  stress,  a“(r,z),  are  obtained  as 


follows: 


dg'(r,z)  =  <J?1»(r,2)  +  ^ 


(16a) 


*  *(r,z)  = 

r3  r  1  r2 

+P33(z){— — T  ~  (h2  +  rir2  +  ~ ~hh(h  +  r2)— 7 7}  +  P34  (2X77-7 


-(r,  +  r2) — - —  +  Tl*2  }  +  p34(z)— { — - (rt  +  r2) — - —  +  — —  rtr2}] 

1  2vm  +  2  vm  + 1  P34Wrir2lvm  +  2  2\m+l  rvm 


+vm[-p3i (z)  ^7  Y) +  P32(Z)  (r2  -  ii)(vm  +  2) +  P33(Z){  (^74) 


^  +  +r2)^}++P34(Z){^-(ri  +  r2)(^)} 
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+P35(Z){ — -r — -  — — }  -  Pn(z)- - - - —  +  Pn(zk - - - — ] 

rlr2(Vm+1)  (r2~rl)(vm+2)  ^2  “  rl  Xvm  +  2) 


(16b) 


The  axial  strain  is  now  obtained  by  substituting  into  the  compatibility  equation: 

ef  (r,z)  =  e“  *(r,z)-^L-Co(z)(logr-l)+-^-{rC1(z)  +  C2(z)} 
where 


(17a) 


e?  *  (r,z)  =  -^-[P3i(z)^ — M  +  P32(z)(r-'  3r-;''  +  P33(z){^r 


6(r2  -  q) 


6(r2-q) 


r3  r2  r4  r3  r2 

-(r2  +  qr2  +  r22)—  +  qr2(q  +  r2)y}  +  P34(z){—  -  (q  +  r2)—  +  qr2  —} 


6qr2  2qr2 


(r2  ~q)  2(vm  +  1) 


- - }  +  P32(z) - - - { - - - — - }  +  P33  (z){ - - - 

6(vm  +  2);  F32W(r2-q)l6(vm  +  2)  2(vm+l)'  F33Wl20(vm +  4) 


_fr2  +  rr  2^  r  (q  +  r2)qr2r  ■■  r  (q  +  r2)r3 

1  12  6(vm  +  2)  2(vm  + 1)  P34(  ){12(vm+3)  (vm  +  2)6 


2(vm  +  l) 


6(vm  +  2)  2(vm+l) 


(q+r2)r  qr2rlog(r-l) 

s _  .  i\  U 


Vm 

-^-C-P31(Z) 


6(r2-q)(vm+2) 


+  p32(z) 


6(vm+2)(r2-q) 


+  P33(Z){ 


20(vm  +  4) 


-(r2  +  qr2  +  r|) - - - }  +  P34(z){ - - - (ri  +  r2)r  j  +  p"  (z){ - £ - 

6(vm  +  2)'  °(vm+3)  6(vm  +2)  P35Wl2qr2(vm +1) 


..  r3  ..  r3 

-(log  r-l)}-Pll(z)  — - - - —  +  P12  (z)  — - - - — ] 

6(r2  -  q)(vm  +  2)  6(r2  -  q)(vm  +  2) 


6(r2-q) 


6(r2-q)J  Gc 


6q(r2-q) 
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+Ps^)f^  +  +  ^  ^  +  &-  +  WU 

^h(T2~  ri)  6rt  r2 


(17b) 


d”(r,z)  =  Emef  (r,z)  + vm{  crm  (r,  z)  +  6“  (r,  z)} 


x“(r,z)  =  x®  *  (r,z)  +  Cq  7 - }  +  C0(z) 


(l- vm)rVm  2G 


+  77; — {Ci(z)  +  C2(z)}  +  (19a) 


where 


s  r  ...  ,  N(15r2r3-4r4)  ...  ,  ,  (4r4 -15rir3)  .  ...  ,  xr  r6 

xg.(r.Z)  =  -[p3,(z)  12^_fi)  +  te«  120(r2:-^  +  P33^ 

r4  r3  ...  r5  r4  , 1 

-(rf  +  ^2  +  r22)— +rlr2(r1  +  r2)y}  +  pi(zX— -(rl  +  r2)~+r1r2r3-} 

~30(vm+2)}  +  P32(Z)  Vrh{30(vm  +2) "  8(vm  +1)}  +  P33(Z){120(vm  +4) 

+(r2  +  rr  ,  r2-i  r4  ,  hhih  +  r2)r3  ■»  (  r5  (*i  +  r2)r4 

(l  1  2  2  )30(vm  +  2)  8(vm  + 1)  P34  72(vm  +3)  30(vm+2) 


+_¥2£L_}  +  p3  (z){ - E! - Mf2^+^£l(logr_1/3)}] 

8(vm  + 1);  P35V  A30(vm  +  2)  8(vm  + 1)  3vm 


-[— P3l(z) 


30(r2  -  r1)(vm  +  2) 


+  P32(Z) 


30(vm  +  2)(r2-r1) 


+  P33(Z){ 


40(vm  +  4) 


(r12  +  r1r2  +  r22)r4  ■■■  r5  (ri  +  r2)r4  -  r3 

30(v_+2)  P34(  (36(vm+3)  30(vm  +  2)  P35<  )f8r1r2(vn  +  l) 


4  4 

.£1281+2!) .  p- (z) - £ - +P-2(Z) - £ - ] 

2  4*  Pm  •,30(r2-rl)(vn+2)  P12W30(r2 -r,)(vm +  2)J 


-^tp- (z)  (15r3f3  t.4£4). + p- (z)  <4r<  -  ‘V)) + -L[p;  (z)  (15r^  ~  4r<) 

EmlPllk  120(r2  -  rt)  Pl2'  120(r2-ri)  GmlP51'  HOqfe-r,) 
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.--.I 


■V"[pilW  Zy'i  )  *  ^-'^u-Z'  *  r»Mj  -  ™  *  tij 

3  2  2 

+rlr2(Il  +  l^}  +  P34(z)£-  -  (rl  +  r2)~r  +  rlr2  |}  +  P35(Z){^T  ~ (h  +  r2)^  +  ^}] 
^  4  3  2  3  2  r 


-Vm(l  +  Vm)[p31(z) - - - { - ^ - - - }  +  P;,(z) - - - { - - - 

(r2 ~1i)i2(l  +  vm)  3(vm+2)'  P32W(r2-r1)t3(vm  +  2) 


-  tTTTTTT  ^ +  P33  (z)^  c77T~To\  ~  ^  +  rtr2  +  r2 )  + } 


2(vm  +  1) 


5(vm+2) 


3(vm  +2)  2(vm  + 1) 


,  j-3  j.2  _  I  2 

+p34(z){ - --(r,  +r2) - +  r,r2 - }+  pU(z) - { - 

vb+3  2  3(v„+2)  l!2(v.tl)'  P35Wr,r2(3(vm+2) 


(r,+r2)r  r,r2,,  2r  •  ,  ,  r2 

.  +  }]  _  vm2[— p31  (z)  ■  •_  +  p32(z): 


2(v  + 1)  v. 


3(vm  +  2)(r2-ri)  ,3(vm  +  2)(r2-r1) 


+p33(z){ — — - (r2  +  rxr2  +  r2 ) - - - }  +  p34(z){ - - - (ri+r2)r  1 

5(vm  +  4)  12  2  3(vm  +2;  P34i;t2(vm+3)  3(vm  +  2)1 

+P35(Z){2(vm+l)r1r2  vm}  Pll(Z) 3(vm  +  2)(r2  -  +  Pl2 (Z) 3(vm  +  2)(r2  - ri)] 


-.1— vm 

drm(r,z)  =  o”  *  (r,z)  -  (1  +  vm)C0(z)vmr1-v">  +  Q(z)- - — - - 

(1- vm)(2- vm) 


(19b) 


2 _ v 

+C°(Z)f2  v  V3  v  I1  +  ^{C‘(Z)  +  C*(z)> - CJ <z>  +  — 
(i-vmX3-vm)  6Gm  r 


The  expression  for  g“  *  (r,z)  is  too  long  to  be  incorporated  in  this  brief  report 
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The  above  procedure  introduces  10  "constants"  of  integration  —  5  per  layer —  which  are 
functions  of  z.  These  are  determined  by  using  appropriate  boundaiy/interface  conditions  including 
those  at  the  axis  of  symmetry.  In  the  present  two-layer  fiber-matrix  concentric  cylinder  model,  the 
continuity  of  the  radial  and  shear  stresses,  and  also  of  the  radial  and  axial  displacement 
components,  is  enforced  at  the  fiber-matrix  interface.  Additionally,  the  radial  displacement 
component  and  the  shear  stress  vanish  at  the  axis  of  symmetry.  On  the  outer  free  surface,  the  radial 
and  shear  stresses  vanish.  The  details  are  available  in  Chaudhuri  et  al.  (to  be  published),  and  are 
omitted  here  because  of  space  limitation. 

Hl-posedness  and  Its  Remediation  Through  Regularization 

The  above  procedure  loses  the  end  boundary  conditions  in  the  axial  direction  (i.e.,  at 
surfaces  z  =  constant) ,  a  kind  of  mathematical  Alzheimer's,  thus  rendering  the  boundary-value 
formulation  ill-posed.  This  is  due  to  the  fact  that  the  boundary-value  problem  has  now  been 
transformed  into  an  initial  value  problem,  which  is  analogous  to  Hadamard’s  treatment  of  the 
Cauchy  problem  (see  Tikhonov  and  Arsenin,  1979).  This  ill-posedness  is  removed  by  introducing 
appropriate  boundary  constraint  terms  that  help  satisfy  the  end  boundary  conditions  at  surfaces  z  = 
constant.  The  details  are  available  in  Chaudhuri  et  al.  (to  be  published),  and  are  omitted  here 
because  of  space  limitation. 

3.  Example  Problem  —  Preliminary  Results 

As  an  illustration  of  the  present  approach  and  to  examine  the  fidelity  of  its  predictive 
capability,  we  consider  the  body  depicted  in  Figure  2  and  compare  our  result  to  the  respective 
Pagano’s  (1991)  variational  model  solution.  We  assume  that  all  the  boundaries  are  traction-free 
and  that  the  body  is  subjected  to  a  1°C  temperature  rise.  The  fiber  is  represented  as  a  single  solid 
cylinder  or  the  core,  while  the  matrix  is  represented  as  a  single  annular  ring  (Pagano's  N=l). 

Both  materials  are  assumed  to  be  isotropic  with  the  following  properties 

Ef  =  413GPa  vf  =  0.2  ccf=  3.25  p/C 

Em  =  63GPa  vm  =  0.2  am  =  3.50|i/C 

where  the  subscripts  f  and  m  stand  for  fiber  and  matrix,  respectively,  and  the  geometric  parameters 
are  taken  as 
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As  a  first  step,  the  present  solution  is  further  simplified  by  assuming  two-dimensional 
"plane  strain"  condition,  and  dropping  1/r  terms  in  the  equilibrium  equations.  Additionally,  the 
constitutive  relations  used  are  those  due  to  plane  strain.  Although  the  preliminary  results  thus 
obtained  demonstrate  that  the  present  approach  can  be  implemented  in  the  existing  FORTRAN 
code,  the  accuracy  of  the  results  are  far  from  encouraging.  We  are  currently  in  the  process  of 
implementing  the  exact  axisymmetric  analysis  in  the  FORTRAN  code.  The  results  will  be  reported 
in  Chaudhuri  et  al.  (to  be  published. 

4.  Summary  and  Conclusions 

A  semi-analytical  iterative  approach  for  enhancing  the  existing  two-dimensional  quasi- 
continuous  axisymmetric  stress  field  for  a  brittle  matrix  micro-composite  (i.  e.,  a  single  fiber 
surrounded  by  a  concentric  matrix  cylinder),  is  presented.  In  the  present  approach,  the  stress 
distribution  in  the  radial  direction  obtained  from  the  afore-cited  variational  model  is  improved  a 
posteriori  through  an  iterative  approach  that  involves  successive  substitution  of  the  previously 
computed  strains  (or  stresses)  into  the  equations  of  compatibility  and  equilibrium.  The 
boundary/interface  conditions  at  r  =  constant  surfaces  are  satisfied  in  the  pointwise  sense. 
However,  this  process  leaves  the  end  boundary  conditions  in  the  axial  direction  (i.e.,  at  surfaces  z 
=  constant)  undefined  thus  rendering  the  boundary- value  formulation  ill-posed.  This  ill-posedness 
is  removed  by  introducing  appropriate  boundary  error  terms  that  help  satisfy  the  end  boundary 
conditions  at  surfaces  z  =  constant.  As  a  first  step,  an  approximate  plane  strain  version  of  the 
present  solution  is  implemented  in  a  FORTRAN  code.  An  illustrative  thermal  stress  problem  is 
solved  using  the  plane  strain  version  and  used  to  compare  with  the  existing  variational  solution. 

We  are  currently  in  the  process  of  implementing  the  exact  axisymmetric  analysis  in  the 
existing  software.  The  results  will  be  reported  in  Chaudhuri  et  al.  (to  be  published. 

The  ongoing/future  work  is  concerned  with  the  determination  of  the  "stress  intensity 
factor"  by  way  of  matching  the  local  asymptotic  stress  field  due  to  Zak  (1964)  with  the  improved 
solution  at  an  arbitrarily  close  distance  from  the  point  of  stress  singularity  (i.  e.,  fiber-matrix 


interface  located  at  a  free  edge).  The  model  is  designed  to  analyze  experiments  such  as  pullout 
tests,  and  also  to  represent  the  concentric  cylinder  model  of  a  composite  representative  volume 
element  and  it  contains  the  capability  to  enhance  the  accuracy  of  a  given  numerical  solution.  When 
completed,  this  research  will  represent  a  novel  semi-analytical  approach  to  improve  solution 
accuracy  and  numerical  efficiency  of  existing  variational  solutions  for  micro-composites. 
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NEW  TECHNIQUES  FOR 
NON-COOPERATIVE  TARGET  IDENTIFICATION 

Julian  Cheung 
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ABSTRACT 

A  new  methodology  for  identifying  aircraft  using  radar  range  profiles  through  application  of  the 
generalized  likelihood  ratio  test  is  developed.  The  new  identifier  maximizes  the  probability  of  target 
identification,  at  a  prespecified  Type  I  error,  in  dependent  Gaussian  noise,  and  it  identifies  the  target 
without  the  compromise  of  making  forced  decisions.  When  subjected  to  more  restrictive  conditions, 
it  reduces  to  other  classes  of  identifiers,  including  the  maximal  cross-correlation  identifier  and,  after 
power  transformation,  the  shortest  Euclidean  distance  identifier.  All  system  parameters  necessary 
for  implementation  are  estimated,  and  the  procedure  does  not  require  any  prior  information  about  the 
statistical  properties  of  the  measured  data.  The  results  of  experiments  with  an  extensive  real  high 
range  resolution  (HRR)  data  set  demonstrate  that  the  proposed  identifier  attains  reasonable 
probability  of  detection  and  is  amenable  to  real  time  processing.  It  also  suggests  the  feasibility  of 
target  identification  using  synthetic  signatures,  and  the  implementation,  except  for  minor 
modifications,  parallels  its  counterpart  using  measured  signatures. 
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NEW  TECHNIQUES  FOR 
NON-COOPERATIVE  TARGET  IDENTIFICATION 

Julian  Cheung 

I.  INTRODUCTION 

The  application  of  radar  to  target  detection  has  been  an  important  problems  almost  since  its  invention. 
The  crucial  issue  that  has  been  overlooked  is  to  identify  what  the  radar  has  detected,  without  which 
proper  action  cannot  be  initiated.  This  urgency  is  evident  by  the  downing  of  two  US  helicopters  by 
two  US  F15’s  over  the  Kurdish  region  in  Iraq.  In  its  own  right,  target  identification  is  an  important 
topic  in  pattern  recognition.  Motivated  by  these  needs,  there  is  a  tremendous  effort  to  advance  the 
problem  of  target  identification  using  radar.  A  promising  approach  is  to  use  the  range  profile,  which 
is  the  1-dimensional  (D)  scattering  distribution  of  a  target  along  the  meradial  distance,  for  data 
representation.  Besides  providing  information  pertinent  to  the  location  and  scattering  strength  of  the 
target’s  scattering  centers  at  a  particular  aspect,  the  shape  of  the  range  profile  is  essentially  preserved 
for  minor  changes  in  aspect  angles.  This  should  be  compared  to  the  target’s  radar  cross-section 
which,  due  to  its  dependency  on  frequency  and  aspect  angles,  is  preventing  it  from  use  in  direct 
applications. 

Most  of  these  algorithms  are  designed  based  on  maximizing  the  correlation  of  the  testing 
profile  with  the  target  signatures  in  the  data  base  [1-4],  The  pre-stored  target  signature  correlating 
most  strongly  with  the  testing  profile  is  declared  to  belong  in  the  latter’s  target  class.  The  maximal 
correlation  technique  is  simple,  fast  and,  after  normalization,  it  is  not  susceptible  to  the  variability  of 
the  underlying  profile’s  power  resulting  from  the  ever-changing  radial  distance  between  the  target  and 
the  radar.  However,  it  does  not  maximize  the  probability  of  target  identification  and,  therefore,  it  is 
a  suboptimal  identifier.  Neither  does  the  procedure  accommodate  the  intra-bin  dependence  of  noise 
processes  due  to  the  high  sampling  rate  afforded  by  modem  radar.  At  best,  all  these  algorithms  only 
make  forced  decisions:  they  decide  the  testing  profile  to  be  a  target  class  trained  in  the  library  set, 
even  if  the  underlying  profile  belongs  to  a  target  class  not  in  the  library  set.  These  difficulties  are 
addressed  in  this  paper,  by  developing  a  new  methodology  for  identifying  an  aircraft  using  radar  range 
profiles  and  an  algorithm  for  computing  them.  A  detailed  discussion  of  this  methodology  can  be 
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found  in  [5],  For  purpose  of  brevity,  only  the  basic  concepts  are  presented  in  this  technical  report, 
which  is  organized  as  follows. 

Section  II  is  devoted  to  the  derivation  of  optimal  target  identifiers  based  on  the  generalized 
likelihood  test  with  unknown  target  signatures  and  dependent  Gaussian  noise  processes.  Section  III 
considers  the  problem  of  estimation  of  system  parameters.  Section  IV  concentrates  on  some  practical 
problems,  including  the  variability  of  view  of  aspect,  power,  and  signature  location  that  plague  a 
parametric  system,  by  suggesting  appropriate  remedies  to  circumvent  these  difficulties.  The 
compensation  of  synthetic  target  signatures  in  place  of  corresponding  measured  signatures  is  also 
considered.  In  Section  V  identification  experiments  using  real  radar  range  profiles  are  undertaken 
to  demonstrate  the  efficacy  of  the  new  target  identifiers.  Further  discussions  and  suggestions  are 
presented  in  Section  VI. 


H.  OPTIMAL  TARGET  IDENTIFICATION 

In  this  section  the  target  identification  problem  for  binary  classes  is  formulated.  The  resulting 
identifier  requires  only  the  noise  covariance  matrix  and  the  average  target-signature  vectors.  This  is 
generalized  to  theM-ary  identification  problem. 

(a)  Binary  Classifier:  Consider  the  radar  scatterer  returns  a:=[Xj  -  x^'  from  5  range  bins.  According 
to  [6-7]  the  return  from  bin  b,  xb,  is  a  complex  scalar  that  entails  in-phase  and  quadrature 
components  x^,  xb.j,  respectively,  b=l,  -,B. 

When  target  class  (Oj  is  present,  i.e.,  hypothesis  Hl ,  each  real  part  of  xb  is  due  to  target  plus 
noise,  implying  that  xfe;/?=v6;/?+s(1)6;/?  and  xbj=vb-j+s(l)b.j-  In  each  bin  the  noise  components  are 
normally  distributed  with  zero  mean  and  variance  XA ob,  so  ,  vb.j  ~N( 0,  V20b ) .  Introduce  a  2- 
tuple  variate  yb~[xbB  x^]7.  The  5-tuple  complex  scatter  returns  x  are  converted  to  a  25-tuple  real 
scatter  returns  y,  y=\y[  -y'B]',  whose  pdf  is,  for  s(mJi  s(lwJ]\ 

f{yW\)  =  (2tc)"2s|E>J  ~v'  exp  (-  Vz(y  -  (y  -  s(1)))  .  (2.1) 
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The  covariance  matrix  Sy  2y  =  cov  (y  \  Hx)  =  cov  (y  |  H2)  =  {  aJk },  contains  B*B  submatrices  ajk: 


ct.]k  =  cov(ypyk)  = 


covix.^x^)  cov(xfP,xtJ) 
c°v(Xj.j. ,x^)  cov(xrJ,xkJ) 


cov(Vj^vkJ})  cov(vm,vkJ) 

cov(vjj’vtjd  cov(vV 


(2.2) 


In  this  paper  bold  case  letters  represents  vectors/matrices;  | .  | ,  ||.|| ,  prime  denote  determinant,  norm, 
and  transposition,  respectively. 

Under  hypothesis  H2  target  class  o2  is  present,  we  have  xb  =  (vM+s(2)fc;R)  +  i  (yb.j+s(2)b.J) . 
The  pair  s(2)b;R,s(2)b.j  represents  the  in-phase,  quadrature  returns  to  the  target  co2  at  range  bin  b .  The 
corresponding  multivariate  pdf  is 

/( y  I H2)  =  (2it) ‘“12^1  ‘,/2  exp  ( -  Vi{ y  -  s(2))'  2; 1  ( y  -  s(2)) )  .  (2.3) 

The  identification  of  a  target  embedded  in  noise,  based  on  HRR  radar  returns,  is  thus 
formulated  as  a  dual  hypothesis  testing  problem 

HV  y  =  [  VBJi+S(\)B,R  VBJ+S(\)Bj]  (2.4) 

y  =  [Vl^+5(2)1^  VIJ+S(2)IJ  VB-Jt+S(2)B-Ji  VBJ+S(2)B-j1  ‘ 

Taking  the  natural  logarithm  of  the  ratio  of  (2.1)  to  (2.3)  and  simplifying,  we  have 

Q2(y)  -  Ox(y)  ><  t21.  (2.5) 


The  quadratic  terms  are  Qm(y)  -  (y  ~  s(m));  ( j  -  s(m)) ,  w=l,2 .  The  threshold  t21  is  chosen  for 

a  prespecified  Type  I  error  a,  with  subscripts  2,  1  indicating  it  is  set  for  the  pair 
(hypothesis,  alternative)  =  (H2,  H^) . 

The  decision  rule  is:  if  Qx(y)  that  is  designed  on  the  basis  of  the  presence  of  target  class  Wj 
is  less  than  Q2(y),  the  latter  designed  on  the  basis  of  target  class  co2 ,  by  r21  then  class  u),  matches 
better  to  the  observable  y  and  so  Hx  is  accepted;  if  Ox(y)  exceeds  Q2(y)  then  H2  is  accepted. 
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Under  Hl  the  quadratic  term  Qx(y)  is  close  to  zero  while  02(y)  is  a  large  positive  number. 

This  is  because,  recalling  (2.1),(2.3)  and  the  principle  of  the  likelihood  ratio  test  [8],  /(j|i/j)  is 

designed  to  be  maximum  that  is  attained  when  Qx (y)  is  minimum;  f(y\H2)  is  not  designed  to  be  a 

maximum  and  so  02(y)  is  not  a  minimum.  As  expected,  02(y)  -  Ofy)  is  a  large  positive  value, 

yielding  the  correct  decision  Hx .  By  the  same  argument  for  H2  to  be  true  Q{(y)  is  a  large  positive 

value  while  Q2(y )  is  nearly  zero.  Therefore,  02(y)  -  Qx(y)  is  small,  leading  to  the  correct  decision 

that  H2  is  true.  Consequently,  the  above  classifier  is  effective  in  emphasizing  a  valid  target  and 

suppressing  the  invalid  target  and  noise.  In  fact,  the  quadratic  classifier  is  a  sufficient  statistic  of  the 
likelihood  ratio  test  and  it  is  the  most  powerful  classifier. 

(b)  Unforced  M- aiy  Classifier:  The  binary  classifier  can  be  generalized  to  generate  the  M- ary 
classification,  for  instance,  by  picking  target  class  to;  such  that  0,(y)  <  Qm(y),  m  6  (1,  ~,M}  A  m*l 

[8].  This  approach  does  not  consider  the  situations:  i)  y  may  be  noise  only,  ii)  the  underlying  target 
may  belong  to  an  unknown  class.  The  italized  word  signifies  that  the  corresponding  class  is  untrained 
and  hence  not  in  the  library  set.  AnM-ary  classifier  can  be  implemented  via  a  two-step  process:  ia) 
place  all  QJy)  satisfying  O0(y)  -  Qm(y )  >  t0m(a/M)  into  the  set  C.mffl,  -,  M)\  ib)  decide  on  H0 

if  C  is  an  empty  set,  otherwise  decide  on  Hl  corresponding  to  the  smallest  element  Q^y)  in  the  set. 

The  threshold  x0m(a/M)  is  determined  in  the  learning  period  pertinent  to  class  at  Type  I  error 

a/M  (for  each  subsystem);  and  Q0(y)  =y'lLyxy  presupposes  a  null  feature  vector  (no  target). 

Still  ia)  does  not  cater  to  the  situation  ii)  (unknown  class  of  targets).  This  is  because  the 
feature  vector  for  an  unknown  target  class  is  not  available  and,  therefore,  its  threshold  cannot  be  set. 
This  dilemma  is  bypassed  on  recognizing  that  the  distribution  of  Om( y)  is  known  from  the  learning 

period.  To  see  this,  denote  Qm,Om  the  minimum,  maximum  of  Om(y)  computed  in  the  learning 

interval.  An  M- ary  classifier  with  unforced  decision  thus  may  be  implemented  by: 
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Place  all  triplets  im,  Qm(y),  Qjm(y)),  m  e  { 1,  -,M),  satisfying  the  condition 

0;TO(  j) s  ( Qj(y)  ~  Qm(y ) ) '  ^SalM)  * 1  in  a  set  d  . 

If  Z)  is  not  an  empty  set,  order  the  elements  in  the  set  from  the  smallest  to  the  largest.  Decide  on  Hl 
for  the  smallest  possible  element  Q/y)  (since  it  belongs  to  D  it  naturally  satisfies  the  normalized 
thresholding  condition)  such  that  its  second  component  Ql(y)£  [  Qj,  Ot ) .  Otherwise  decide  on  . 

(2.6) 

Here  iZ6  is  the  composite  hypothesis  pertaining  to  no  target  or  a  target  of  unknown  class.  By 

normalizing  Q}(y)  -  Qm(y)  with  respect  to  the  corresponding  threshold,  Q^y)  -  Qm(y )  can  now 

be  quantified  with  other  members  of  the  class.  Direct  comparison  in  the  ordinary  M-ary  testing  is 
impossible,  owing  to  the  incompatible  thresholds  for  different  pairs  in  simple  hypothesis  testing. 

We  remark  that  an  unforced  classifier  is  desirable  because  it  avoids  making  a  decision 
contingent  on  the  known  target  classes  when  there  is  no  target  or  the  target  does  not  belong  to  the 
known  classes.  Nevertheless,  a  larger  number  of  target  classes  lowers  the  false  alarm  rate  for  each 
of  the  subsystems,  thereby  increasing  their  thresholds  and  decreasing  the  identification  power.  These 
drawbacks  are  inherent  in  both  the  unforced  and  forced  M- ary  systems. 

ID.  SYSTEM  PARAMETER  ESTIMATION 

In  this  section  the  estimation  of  parameters,  necessary  for  the  implementation  of  the  identifier,  is 
considered. 

(a)  Target  Signature  Estimation:  Suppose  N  statistically  independent  samples  ym  are 

collected  during  the  learning  period  and  target  class  u>m  is  present  a  priori  throughout  the  duration. 
The  posteriori  pdf  is  evaluated  from  (2. 1)  as 

f(ym  (2”)'“" !  V  exP  (-  Of.rW'  V  ^r.rV )  ■ 

Taking  the  natural  logarithm  and,  because  of  the  identity  AlxJBJxl  =  trace  (BA),  we  obtain 


12-7 


ln f(ym  ">y\x\ \HJ  = c  -  'AN[n  \%y\  ~ 1/2  Eli trace  (21  (th  "  W  Ow  ~  W;) » (31> 

and  c  is  a  constant. 

Utilizing  the  identity  {dldA)  trace  {A'B)  -  B,  the  derivative  with  respect  to  s(m)  is 
(dlds(mJ  W(y[iy  -ym\HJ  =  ~  2  Ell  V  (j'w'W-  Setting  the  derivative  to  zero  and,  because 
Ey1  is  invertible,  the  maximum  likelihood  estimate  (MLE)  of  is 

\m)  ~  N  1  ^=1^  •  (3 .2) 

Taking  the  derivative  with  respect  to  s(m)  once  more  to  get 
(d2lds2{m))lnf(ym  ",yw\HJ  =  -NE^1;  it  is  a  negative-definite  matrix,  implying  that  s(m)  is  a 
necessary  and  sufficient  estimator  of  s(m) . 

(b)  Noise  Covariance  Matrix  Estimation:  Since  the  covariance  matrix  E^  is  the  same  for  all  target 
classes,  it  suffice  to  estimate  the  covariance  matrix  under  Hx .  Upon  taking  the  derivative  of  (3.1) 
(s(m)  replaced  by  its  estimate)  with  respect  to  hy  and  invoking  the  identities  {dldA)  ln  | A  \  =A~l , 
{dldA)  trace  {A1  II)  =  -A’lBA~l,  then 

-v4jvz;'  V^rW^rW'1;'  (33> 

Setting  it  to  zero,  the  MLE  is 

=  N  1  E«»i  ~  ■*(!))  {y^n-\  ~  \l})'  .  (3.4) 

Thus  2^  is  estimated  by  (3.4)  in  conjunction  with  (3.2).  Furthermore,  making  use  of  data 

samples  from  all  a  posteriori  target  classes  the  estimate  can  be  improved,  in  the  context  of 
convergence  rate,  as 

%  -  <m"  III-,  XT.,  ow -  W (J’r.i  -  <3-5) 
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Because  j>M,  n=\,  •••,  N,  is  Gaussian  distributed,  all  MLE  estimators  developed  hitherto  are 
also  the  minimum  variance  unbiased  estimators  [9]. 

IV.  UNCERTAINTY  CONSIDERATIONS 

In  this  section  some  practical  problems  confronting  a  parametric  HRR  classifier  are  considered. 
These  introduce  additional  layers  of  complexity  into  the  system  thereby  degrading  its  performance. 
Some  solutions  are  suggested  below. 

(a)  Variability  of  View  of  Asnect:  Until  now,  the  view  of  aspect  (<t>,  0)1  of  a  target  is  assumed 
constant.  In  reality  this  is  not  so,  either  because  of  the  imperfection  in  estimation  or  the  high 
maneuverability  of  modem  aircrafts.  This  makes  the  target  identification  problem  difficult.  To  see 
this,  consider  an  aircraft  of  length  L  (nose  to  tail),  width  W  (farthest  distance  between  the  wings), 
height  H,  and  view  of  aspect  (4>0,  0O) .  Suppose  the  aircraft  is  nose-on  to  the  radar.  In  this  case, 

radar  returns  from  the  front  side  on  both  wings  appear  in  the  same  set  of  bins.  The  same  can  be  said 
for  any  subgroup  of  scatterers  equidistant  to  the  radar  line  of  sight.  Suppose  now  the  view  of  aspect 
is  switched  to  (4>1}  0O),  4>^4)0,  then  returns  from  the  front  side  of  both  wings  may  no  longer  be  in 

the  same  bins.  The  same  phenomenology  occurs  when  the  angle  of  elevation  changes.  Apparently, 
a  target  at  different  views  of  aspect  casts  different  range  profiles. 

The  variability  of  the  range  profile  of  a  target  (after  alignment)  as  affected  by  the  view  of 
aspect  often  confuses  a  parametric  classifier;  it  assumes  that  the  embedded  signature  of  the  same 
target  stays  the  same  from  one  profile  to  the  next.  However,  the  shape  of  the  range  profile  does  not 
vary  much  if  the  aspect  angle  changes  are  sufficiently  small.  This  condition  is  assured  if  the  maximal 


1  Imagine  an  aircraft  parked  on  the  equatorial  plane  of  a  sphere  in  such  a  way  that  its  nose  points  to  the  South,  the 
wings  to  the  East- West,  the  intersection  of  the  wings  and  the  fuselage  coincides  with  the  sphere’s  center,  and  the  radar  is  along 
the  South  and  noise-on  to  the  aircraft.  Suppose  the  aircraft  rotates  with  respect  to  the  center.  The  angle  of  azimuth  (J) , 
-180°  <  <j)  <  180°,  is  the  angle  between  the  South  axis  and  the  projection  of  the  fuselage  onto  the  equatorial  plane.  The 
angle  of  elevation  0 ,  -90°  <  0  <  90°  ,  defines  the  angle  between  the  aircraft  fuselage  and  its  projection  on  the  equatorial 
plane.  The  pair  ((j),  0)  constitutes  the  view  of  aspect  of  the  aircraft  with  respect  to  the  radar.  A  segment  of  the  sphere’s 
surface  forms  an  aspect  window. 
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time  difference  among  all  scatter  returns  caused  by  the  aspect  change  is  less  than  the  radar  sampling 
interval.  This  is  equivalent  to  the  situation  that  the  maximal  distance  difference  among  all  compatible 
scatters,  due  to  aspect  angle  changes  (a4>,  a0),  is  less  than  the  radar  resolution  rresol  and  it  is 

satisfied,  for  a(|),a0>O  ,  by2 

Z, A<f> < Vi rresol,  W^<V2rresol,  H L$<V2rresol.  (4.1) 

Given  the  radar  resolution  and  the  target  dimension,  it  becomes  feasible  to  find  the  pair 
(a4>,  a0)  satisfying  (4. 1).  One  thus  can  be  assured  that  the  profile  of  a  target  at  the  nominal  view  of 
aspect  (<J)0,  0O)  is  essentially  the  same  as  any  profiles  of  the  same  target  in  the  aspect  window 

[<t>0-^A<f),  <j>0+'/2A(j>)  X  [0o-'/2A0,  0o  +  '/2A0). 

Suppose  [<j),  ({)] ,  [0,  0]  denote  the  closed  intervals  of  azimuth  and  elevation  angle, 

respectively;  the  underbar,  overbar  represent  the  minimum,  maximum  of  the  associated  parameter. 
In  the  R*R  space,  the  domain  underlying  the  variability  of  <j),  0  is  divided  into  NQ  subregions 

commonly  called  aspect  windows  ( a  |  =  nQ=l,-,NQ },  where  =  |_  (I~£)/a£  J,  £=(j>,0 . 

Aspect  window  an  is  a  2-D  region  in  the  space  of  view  of  aspect,  and 

\„e  =  [  ~ 1 ) ■ *4>, A<j) )  x  [  („g  - 1)  A0,  ne  a0  ) . 

With  this  in  mind,  an  unrestricted  binary  identifier  that  compensates  for  the  variability  of  view 
of  aspect  is  thus  a  restricted  binary  identifier  that  assumes  exact  view  of  aspect  accommodating  all 
the  N^Nq  aspect  windows.  It  is  modified  from  (2.5)  as 

02O;  Ve)  "  2i(t;  Ve)  ^  T2P  ^V1;iVwe^L  (4-2) 

H2 


2Suppose  the  aircraft  is  nose-on  to  the  radar.  At  (j) =0  the  wings  are  perpendicular  to  the  radar  line  of  sight  (RLOS). 
For  <t>=t{)0  the  projection  of  the  wings  to  the  RLOS  is  W  CoS  (90  +4>0) .  When  (f)  =  4>Q  +A({)  the  projection  of  the  wings 
to  the  RLOS  is  IF  Co5  (90 +4>0+A(})) .  The  difference  is  thus  j  PFS7/?((j>0)  -  IFS7/2(4)0+A(f))  |  “  W  A(j).  If  it  is  less 
than  '/2  Y rtsol  then  both  wings’  returns  will  fall  on  same  set  of  bins.  The  factor  Vt.  accounts  for  round-trip  travel.  Other 
incidences  are  derived  in  a  similar  fashion. 
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Correspondingly,  the  quadratic  term  becomes  Qm(y\  ri^nQ)  =  ( y  -  e))/S>,1  (y  -  s(^  6 ),  m- 1,2. 

The  unrestricted  M- ary  identifier,  either  forced  or  unforced,  is  developed  in  an  analogous 


way. 


(b)  Discrepancy  between  Synthetic  and  Measured  Data:  In  the  same  spirit  as  SPICE  that  simulates 
the  response  of  an  electronic  circuitry,  there  exists  some  sophisticated  electromagnetic  codes  capable 
of  predicting  the  range  bin  returns  of  an  aircraft,  given  its  geometrical  and  physical  structures.  The 
government  proprietary  XPATCH  is  a  good  example  [10].  The  output  from  these  codes  are  called 
synthetic  data;  and  the  data  collected  from  a  real  scenario  form  measured  data. 

The  importance  of  synthetic  data  cannot  be  overstated.  For  example,  normal  access  to  foreign 
aircrafts  is  denied.  The  only  option  is  to  feed  in  structural  information  about  the  unknown  aircraft 
(as  obtained  from  Jane ’s  Aircraft,  for  example)  to  the  degree  possible  from  the  predictive  code,  in 
the  hope  that  the  resulting  synthetic  data  resembles  the  measured  data.  Also,  signature  estimation 
of  an  aircraft  in  a  particular  aspect  window  demands  a  large  quantity  of  profiles  of  a  very  fine 
(possibly  granular)  view  of  aspect  which  may  be  too  costly  or  impractical  for  generation. 

As  Fig.l  indicates,  although  the  synthetic  data  resembles  reasonably  well  the  measured  data 
(after  aligning  bin  252  of  the  former  with  bin  3 16  of  the  latter),  such  agreement  is  qualitative  and  not 

B  t  i 

quantitative.  To  compensate  for  the  inequality  of  the  synthetic  data  s(w)  as  compared  to  the 
measured  data  of  aircraft  class  m  at  aspect  window  a  ,  a  constant  vector  c(m*  e)  is  added 


(”*”0)  (rt.rtg)  (nAne)  r_, 

to  the  synthetic  data.  The  problem  reduces  to  finding  c(m)  such  that  s(m)  »  s(m)  +  c{m)  .  The 

mean-square  error  is  e  =  £  ( ( s{m)  -  s(m*  -  c{m)  )  (s{m)  -s(m)  -  c(m)  )},  and  the  expectation 

is  taken  over  all  samples  in  the  aspect  window  a  .  Taking  the  derivative  of  e  with  respect  to 


c(m)”e>  a°d  setting  it  to  zero,  the  estimate  is 
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Here  (or  s”*^)  denotes  the  j_th  measured  (or  synthetic)  data  set  of  target  class  o)m  in  aspect 
window  a  „  . 


(c)  Nonstationaritv  of  Power:  In  general,  the  power  of  the  observation  y  may  not  equal  to  the  power 
of  the  training  data  set  y^  that  were  used  to  estimate  target  signatures.  Also,  the  power  may 

vary  from  observation  to  observation.  This  may  be  caused  by  several  factors.  Firstly,  the  distance 
between  a  radar  and  its  target  may  be  changing.  Because  the  radar  transmitting  power  is  inversely 
proportional  to  the  square  of  the  meradial  distance  [6],  on  the  receiver  side  the  power  of  the  back- 
scattered  signal  after  a  two-way  trip  is  inversely  proportional  to  the  fourth  power  of  the  distance. 
Secondly,  radar  returns  may  be  attenuated  either  by  the  transmitting  media  or  other  natural  or  man¬ 
made  sources.  Lastly,  the  radar  hardware  in  the  field  may  be  different  from  that  used  for  training;  or 
the  radar  may  be  aging.  Consequently,  the  embedded  target  signature  may  not  be  equal  to  any 
member  of  the  library  set  thereby  degrading  the  identifier.  To  resolve  this  difficulty  the  incoming  data 
y  is  multiplied  by  p  for  power  equalization  between  the  data  and  the  training  set,  that  is, 

(Pl/CP y)  =  The  power  equalizing  factor  is 


P  = 


N'] 


y'y 


(4.4) 


Of  course,  both  the  observation  and  the  training  set  are  associated  with  the  same  target  at  a  specified 
aspect  window. 


(d)  Non-constant  Target  Signature  Location:  In  each  measurement  embedded  target  signatures  may 
not  belong  in  the  same  bins.  Alignment  of  the  observation  is  thus  in  demand  because  of  two  reasons. 
Firstly,  during  training  a  target  feature  at  a  particular  bin  it  is  estimated  by  the  mean  of  the 
observables  at  the  same  bin  (after  noise  compensation,  ct.  eq.(3.2)).  A  large  number  of  observables 
results  in  an  efficient  estimate  of  target  signatures.  The  estimate  is  meaningless  if  the  target  features 
embedded  in  the  observation  are  misaligned.  Secondly,  the  optimal  parametric  identification  is 
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results  in  an  efficient  estimate  of  target  signatures.  The  estimate  is  meaningless  if  the  target  features 
embedded  in  the  observation  are  misaligned.  Secondly,  the  optimal  parametric  identification  is 
designed  on  the  basis  that  when  the  observation  contains  a  target  feature  s(m)  the  quadratic  term 

Qm( y)  =  (y  -  s(m^  (y  -  s(m))  attains  a  minimum,  while  other  quadratic  terms  Qfy),  j*m ,  do  not 

(refer  to  (eq.2.5)).3  This  is  because  Om(y)  =  (vm+  s(M)  -  s(m))7  ( vm  +  s(m)  -  s(m))  would  be  noise  only, 

but  will  not  behave  so  when  target  features  embedded  in  y  are  not  aligned  with  respect  to  the  feature 
vector  in  the  library  set. 

One  way  to  align  the  observation  vector  is  to  correlate  y  with  s(m) 

^(u)  =  X!/=l  ^(m)2/T2y-2o+  ^(m)2;>lT2y+l-2u  ’  (4.5) 


and  y  is  zero-padded  for  out-of-bound  elements.  The  u*  that  delivers  the  largest  value  among  all  S2(u) 
is  the  order  of  bin  the  data  profile  needs  to  shift  to  the  left  for  proper  alignment.  The  summand 
underneath  the  summation  sign  contains  two  terms  because  each  bin  consists  of  two  components. 

The  alignment  algorithm  (4.5)  requires  41 3  2  multiplication  operations  ( B  >  600)  and  is  time 
consuming.  A  more  efficient  computationally  method  is  to  choose  L  largest  features  among  s(m)  that 

belong  to  the  target  a  priori ,  and  the  corresponding  bin  numbers  in  set  D  (see  (2.6)).  The  L  pair  of 
entries  are  subsequently  copied  onto  s ^  (same  indexing),  all  other  elements  of  are  assigned  null. 


A  counterpart  of  (4.5)  is 

^(u)  =  X)/6D  \m)2jy 2/-2u  +  S(m)2j+1  1  -2u’  D  c  . 


(4.6) 


Besides  require  only  4  BL  multiplication  operations  (I<50),  a  subtle  advantage  is  that 
because  major  target  features  are  more  immune  to  noise  effect  (higher  signal-to-noise  ratio),  S2(u) 
may  be  less  susceptible  to  noise  as  well  as  the  generation  and  destruction  of  features  arising  from 
minor  changes  in  the  view  of  aspect.  Moreover,  the  nonlinear  filtering  approach  (4.6)  may  perform 


3  For  simplicity,  throughout  this  section  y  is  assumed  statistically  independent.  In  dependent  processes  Jt ,  S(m)  are 
replaced  by  y,  S  ,  respectively. 
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V.  EXPERIMENTAL  RESULTS 

The  operation  of  the  foregoing  identifier  is  summarized  in  Fig.  2.  A  C-based  software  package  was 
coded  to  implement  the  identification  process.  Using  a  Sun  Sparc- 10  workstation,  it  takes  about  120 
minutes  for:  i)  system  parameters  estimation  using  9906  profiles  entailing  three  target  classes  and  18 
aspect  windows,  and  ii)  identification  using  a  second  set  of  9876  profiles.  Most  of  the  computer  time 
is  spent  on  target  training.  If  identification  only  is  carried  out,  i.e.,  the  lower  portion  in  Fig.2,  the  time 
is  about  60  minutes.  In  other  words,  unforced  target  identification,  when  supplied  with  pre-estimated 
system  parameters,  takes  about  60x60/9876  =  0.3645  seconds  per  profile. 

In  particular,  Fig.3  shows  the  performance  of  the  proposed  identifier  in  18  various  aspect 
windows  using  measured  target  signatures.  The  superior  performance  of  the  new  identifier  speaks 
for  itself.  Some  aspect  windows  encounter  low  probability  of  identification.  We  offer  three  reasons. 
Firstly,  signatures  of  different  targets  (at  least  two)  exhibit  sufficient  similarity  in  the  affected 
windows  as  to  confuse  the  identification  system.  Secondly,  in  the  affected  windows  a  very  low 
number  of  profiles  is  available  for  training  (less  than  5).  System  parameters  cannot  be  estimated  and, 
consequently,  any  parametric  identifiers  simply  break  down.  Lastly,  during  the  course  of  evaluation 
it  was  discovered  that  algorithm  (4.5)  misaligns  at  a  rate  of  about  1  per  20.  Alignment  is  critical 
both  to  signature  estimation  and  target  identification  (Section  IV).  A  single  incidence  of 
misalignment  may  cause  the  process  to  break  down,  regardless  of  how  efficient  it  may  be. 

The  same  diagram  also  presents  the  results  when  synthetic  signatures  are  used.  The 
performance  is  satisfactory. 

In  Fig.4  we  also  compare  the  performance  of  the  new  identifier  when  alignment  is  carried  out 
either  by  the  regular  correlation  algorithm  (4.5)  or  by  the  magnitude  correlation: 

^(U)  =  \j  S(m)2j  +  S(m)2jf\  ^2/-2u  +T2/>l-2u  >  0=1, ~,2?. 

The  inferiority  of  the  magnitude  approach  in  both  aspect  windows  implies  that  misalignment 
occurs  even  more  frequently.  This  is  expected.  It  is  well  known  in  basic  communication  theory  that 
a  detector  under-utilizing  data  information  performs  worse  than  its  counterpart  that  uses  the  frill 
knowledge  about  the  data  [8J.  A  good  example  is  incoherent  (envelope)  detection  versus  coherent 
detection.  Regrettably,  target  identification  using  the  envelope  (magnitude)  version  of  (2.6)  has  not 
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a  detector  under-utilizing  data  information  performs  worse  than  its  counterpart  that  uses  the  full 
knowledge  about  the  data  [8],  A  good  example  is  incoherent  (envelope)  detection  versus  coherent 
detection.  Regrettably,  target  identification  using  the  envelope  (magnitude)  version  of  (2.6)  has  not 
been  coded  in  our  software  package  and  hence  direct  comparison  cannot  be  made.  Based  on  the 
above  results,  we  may  infer  loosely  that  the  magnitude  profile  approach,  including  the  power 
transformation,  performs  worse  than  the  new  detector. 

VI.  CONCLUSIONS 

We  have  developed  a  new  methodology  for  identifying  an  aircraft  using  radar  range  profiles  through 
application  of  the  generalized  likelihood  ratio  test.  The  new  identifier  maximizes  the  probability  of 
target  identification,  at  a  prespecified  Type  I  error,  in  dependent  Gaussian  processes,  and  it  identifies 
the  target  without  the  degradation  associated  with  making  forced  decisions.  If  the  new  identifier  is 
subjected  to  more  restrictive  conditions,  it  reduces  to  other  classes  of  identifiers,  including  the 
maximal  cross-correlation  identifier  and,  after  power  transformation,  the  shortest  Euclidean  distance 
identifier.  All  system  parameters  necessary  for  the  implementation  of  the  identifier  are  estimated,  and 
the  process  does  not  require  any  prior  information  about  the  statistical  properties  of  the  measured 
data. 

The  results  of  experiments  with  an  extensive  real  HRR  data  set  demonstrate  that  the  proposed 
identifier  attains  a  reasonable  probability  of  detection  (see  Fig.  3)  and,  moreover,  it  is  amenable  to  real 
time  processing.  It  also  delivers  satisfactory  results  upon  the  replacement  of  measured  target 
signatures  by  a  superposition  of  synthetic  target  signatures  and  compensation  vectors.  This  is  not 
surprising.  The  identifier  (2.6)  processes  an  HRR  profile  based  on  the  Euclidean  distance  (after 
decorrelation).  The  norm  of  a  profile  that  is  effected  from  a  target  signature  remains  distinguishable 
to  the  data  processor  from  the  norm  of  the  same  profile  that  is  subtracted  from  other  target 
signatures,  provided  the  associated  target  signatures  are  separable  from  each  other;  this  is  true 
irrespective  of  whether  the  signatures  are  of  measured  or  synthetic  target  type.  It  also  suggests  that 
target  identification  based  on  magnitude  or  its  variants  (power  transformation  technique,  for 
example),  is  not  recommended  (see  Fig.4). 

These  experiments,  coupled  with  the  theoretical  analyzes,  justifies  the  proposed  technology 
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for  reliable  aircraft  identification.  Further,  target  identification  using  synthetic  signatures  is  feasible 
and  its  implementation  procedure,  except  for  minor  modifications,  parallels  its  counterpart  using 
measured  signatures. 
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Magnitude  (in  logarithm) 


Fig. 2  Schematic  diagram  of  unforced  M-ary  target  identification 
using  measured  HRR  target  signatures 
Note:  The  upper,  lower  half  denote  training,  detection  period,  respectively 

j,  m=l,-,M;  j*m;  V  n*,  ne 


12-18 


Probability  of  unforced  target  identification 


Aspect  window 

Fig.3  Probability  of  unforced  identification  of  target  j,j=A  (•),  B  (■),  C  (A), 
that  is  embedded  in  data  and  in  aspect  window  A„,  n=l,  - ,18 , 
using  measured  (solid  line)  or  synthetic  (dotted  line)  data  signature 
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Probability  of  unforced  target  identification 


Target  class 


Fig.4  Probability  of  unforced  target  identification  with  the  HRR  data  profiles 
aligned  by  in-phase  and  quadrature  correlation  (solid  line)  or  magnitude  correlation  method 
(dotted  line)  using  measured  (circles)  or  synthetic  (rectangles)  target  signatures 
Note:  solid,  hollow  symbols  (circles,  rectangles)  represent  aspect  window  Ah  A2; 

1,2,3  on  x-axis  represent  target  class  A,  B,  C,  respectively 
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Abstract 

The  task  of  a  sensor  manager  is  to  improve  the  performance  of  the  individual  avionics  sensors  by 
coordinating  their  activities  based  on  the  sensor  manager's  best  estimate  of  the  future.  This  report  continues 
the  study  of  scheduling  algorithms  for  the  sensor  manager.  It  is  composed  of  two  individual  reports,  Match 
Maker  and  Metrics.  Match  Maker  offers  a  modification  to  the  crossover  algorithm  of  genetic  algorithms 
that  shows  promise  of  speeding  convergence  of  the  genetic  search.  Metrics  is  a  discussion  of  the  role  the 
choice  of  the  evaluation  function  has  on  the  performance  of  scheduling  algorithms. 
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OF  MATCH  MAKER  AND  METRICS 


Introduction 


Milton  L.  Cone 


The  task  of  a  sensor  manager  is  to  improve  the  performance  of  the  individual  avionics  sensors  by 
coordinating  their  activities  based  on  the  sensor  manager's  best  estimate  of  the  future.  One  way  to  improve 
that  performance  is  develop  schedules  that  make  better  use  of  the  limited  resources  onboard  the  aircraft. 
This  report  continues  the  study  of  scheduling  algorithms  for  the  sensor  manager.  It  is  composed  of  two 
individual  reports,  Match  Maker  and  Metrics. 

Match  Maker  offers  a  modification  to  the  crossover  operator  of  genetic  algorithms  that  might  speed 
convergence  of  the  genetic  search.  The  summer  afforded  limited  time  to  test  Match  Maker.  On  a  six 
machine/six  job  job  shop  scheduling  problem  Match  Maker  showed  statistically  significant  improvement  in 
the  average  number  of  schedules  the  genetic  algorithm  needed  to  find  the  known  optimal  solution  for  the 
minimum  time  to  process  all  tasks.  Experience  on  a  wide  variety  of  problems  is  needed  to  test  the 
robustness  of  Match  Maker. 

Metrics  is  a  discussion  of  the  role  the  choice  of  the  evaluation  function  has  on  the  performance  of 
scheduling  algorithms.  The  paper  concludes  that  a  combination  of  classical  metrics,  while  not  maximizing 
the  performance  of  any  one  metric,  leads  to  schedules  that  produce  a  balanced  result  between  schedules  that 
meet  deadlines  and  those  that  maximize  processor  utilization. 
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Match  Maker 


1.  Introduction 

The  strength  of  genetic  search  is  that  it  can  integrate  many  different  dispatch  techniques  into  its  algorithm 
by  using  different  dispatch  rules  to  generate  some  of  the  schedules  for  the  initial  population  that  starts  the 
genetic  algorithm  (GA)  search.  By  returning  the  best  schedule  found  during  the  search,  the  GA  guarantees 
that  the  final  schedule  is  no  worse  than  the  best  schedule  produced  by  the  dispatch  rules  that  helped  build 
the  initial  population.  GA  seems  unique  among  scheduling  approaches  in  its  ability  to  unify  different 
scheduling  algorithms.  Unfortunately  even  with  good  starting  locations,  the  search  can  take  a  long  time  to 
converge  since  the  search  space,  even  for  simple  problems,  is  huge. 

Another  strength  of  genetic  search  is  that,  by  comparison  with  other  search  techniques  [Grefenstette,  et.  al., 
1991],  the  genetic  algorithm  can  rapidly  (comparatively)  search  large  spaces.  Grefenstette  reports  that 
genetic  search  outperforms  gradient  search  techniques  in  realistic  problems  where  the  constraints  of  the 
gradient  search  are  not  completely  met  and  random  search  in  common  problems  that  involve  discontinuous, 
noisy,  high  dimensional,  and  multimodal  functions.  The  problem  for  this  application  is  that  for  a  real-time 
application,  genetic  search  may  still  be  too  slow.  Ways  to  speed  the  search  are  needed. 

2.  Background 

The  final  report  from  the  1995  AFOSR  extension  program  [Cone,  1995]  looked  at  genetic  search's  ability  to 
solve  one  of  the  standard  job  shop  problems,  the  6x6  JSSP.  In  this  problem  there  are  6  machines  to  be 
scheduled  with  6  jobs.  Each  job  passes  through  each  machine  but  not  in  the  same  order  and  each  job  may 
take  different  times  to  execute  on  the  various  machines.  This  problem  is  similar  to  the  one  expected  in  the 
sensor  manager. 

Attempts  to  improve  the  performance  of  the  genetic  search  were  reported  in  [Cone,  1995],  These  included 
staticly  varying  the  crossover  and  mutation  rates  as  well  as  trying  to  optimize  them  dynamically  during 
execution  of  the  search.  The  parametric  study  of  the  mutation  and  crossover  rates  showed  certain 
combinations  of  rates  performed  better  than  others.  Starting  with  a  seed  of  (123456789)  for  the  random 
number  generator,  the  trial,  x,  in  which  the  optimal  makespan  result  of  55  was  first  found  was  recorded  for 
25  experiments  consisting  of  120  generations  with  50  schedules  (trials)  in  each  generation.  The  average 
number  of  trials  needed  to  find  the  optimal  schedule  was  averaged  over  the  25  experiments  and  that  single 
number  reported  as  T.  T  measures,  on  average,  how  fast  the  GA  finds  the  optimal  solution  for  makespan. 
The  assumption  is  that  better  performing  GA's  for  this  problem  will  also  be  better  performing  for  more 
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complex  problems.  The  smallest  T  was  found  at  a  crossover  rate  of  0.4  and  mutation  rate  0.1.  T  for 
various  mutation  and  crossover  rates  varied  from  892  to  5762.  Recent  investigation  found  a  better  T  of 
735  with  a  crossover  rate  of  0.4  and  a  mutation  rate  of  0.14  for  this  random  seed.  Finding  the  correct 
mutation  and  crossover  rates  is  an  important  step  to  improving  the  performance  of  the  GA. 

An  alternate  random  number  seed  was  also  studied  by  Cone  to  see  if  a  different  random  number  string 
would  make  a  difference.  The  alternate  seed  is  (987654321)  and  is  generally  referred  to  as  the  reverse  seed. 
The  best  result  for  starting  the  process  with  the  reverse  seed  was  593  but  that  result  was  discounted  since 
there  was  no  crossover  in  that  particular  study.  Presumably  the  sequence  of  random  numbers  happened  to 
find  the  optimal  solution  early  in  the  search  an  unusual  number  of  times.  With  random  seed  (123456789) 
the  metric  jumps  to  1223  supporting  the  contention  that  the  593  was  atypical.  The  next  two  best  results  for 
the  reverse  seed  were  816  and  866. 

The  dynamic  adjustment  of  mutation  and  crossover  rates  did  not  show  any  improvement  in  the  metric.  In 
fact,  none  of  the  techniques  tried  were  able  to  beat  892  although  the  moving  window,  which  limits  the 
number  of  genes  over  which  crossover  can  occur,  performed  nearly  as  well.  This  lead  Cone  to  try  a  new 
approach,  match  maker,  to  accelerate  convergence. 

3.  Match  Maker 

Match  maker  is  inspired  by  the  characteristic  of  a  species  to  match  the  better  suited  of  a  species  together.  In 
normal  genetic  search,  chromosomes  are  randomly  mated  together  to  produce  off-spring.  In  match  maker 
the  chromosomes  are  matched  before  mating.  This  is  accomplished  by  rank  ordering  the  chromosomes 
based  on  their  evaluation.  Crossover  is  then  performed  between  the  best  chromosome  and  the  second  best. 
Then  third  and  fourth  are  matched  and  so  on. 

4.  Early  Results 

Match  maker  was  first  tried  on  the  same  problem  studied  in  [Cone,  1995],  Each  study 

consisted  of  25  experiments  where  each  experiment  produced  6000  schedules  (trials).  Table  1 .  shows  a 

comparison  of  average  trial  number,  T.  R  indicates  the  process  started  with  the  reverse  seed. 

While  match  maker  didn't  always  improve  the  standard  GA,  where  it  did  the  effect  is  dramatic.  For 
crossover  equal  to  0.4  and  mutation  equal  to  0.1  which  was  mean  trial  892,  match  maker  was  able  to  find 
the  optimal  makespan  on  average  in  564  trials.  Further  tuning  of  the  mutation  rate  got  the  average  trial 
down  to  405.  While  the  data  may  not  always  be  statistically  significant,  the  data  suggests  that  there  is 
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Crossover 


Mutation  Rate 


Standard  GA 


Match  Maker 


something  positive  happening.  By  comparison,  none  of  the  dynamic  optimization  techniques  tried  before 
really  showed  much  improvement 

The  data  in  Table  1.  is  based  on  25  experiments.  Another  study  was  conducted  with  more  experiments  to 
establish  the  statistical  significance  of  the  data.  The  next  section  reports  the  results  of  this  study. 

4.  Study  of  Statistical  Significance 

In  statistical  decision  theory,  [Spiegel,  1988],  a  hypothesis  is  created  for  the  purpose  of  rejecting  or 
nullifying  it.  The  hypothesis  is  denoted  by  Hq  and  is  often  called  the  null  hypothesis.  An  alternative 
hypothesis  is  any  one  that  differs  from  the  null  hypothesis.  This  hypothesis  is  generally  noted  by  Hj.  If  a 
random  sample  is  drawn  from  a  population  but  the  results  of  the  sample  differ  significantly  from  the  results 
expected  under  the  null  hypothesis,  then  the  hypothesis  can  generally  be  rejected  or  at  least  not  accepted. 
The  procedures  that  enable  one  to  determine  whether  observed  samples  differ  significantly  from  the 
expected  results  are  called  statistical  decision  rules. 
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There  are  two  types  of  errors  associated  with  decision  rules.  A  Type  I  error  occurs  when  the  null 
hypothesis  is  rejected  when  it  should  be  accepted.  In  testing  a  given  hypothesis  the  maximum  probability 
with  which  a  Type  I  error  is  allowed  is  called  the  level  of  significance  and  designated  a.  Usually  an  a  of 
0.05  or  0.01  is  used.  An  a  =  0.05  means  that  there  are  about  5  chances  in  100  of  rejecting  a  hypothesis 
when  it  should  be  accepted.  A  Type  II  error  occurs  when  the  Hq  hypothesis  is  accepted  when  it  should  be 
rejected.  The  probability  of  making  a  Type  II  error  is  usually  denoted  by  p.  Type  I  errors  are  more 
important  if  the  goal  is  to  reject  the  null  hypothesis,  while  Type  II  errors  are  more  important  when  Hq  is  to 
be  accepted.  There  is  a  tradeoff  between  a  and  p.  Increasing  the  likelihood  of  making  a  Type  I  error 
reduces  the  likelihood  of  a  Type  II  error. 

In  order  to  develop  a  more  complete  statistical  picture  of  the  effectiveness  of  match  maker,  331  (331  is  just 
a  convenient  number)  experiments  were  run  to  determine  the  mean  trial  number  in  which  55  was  found.  In 
case  55  was  not  found  in  6000  trials  then  the  total  number  of  trials  was  set  at  6000.  This  has  minimal  effect 
on  the  statistics  as  99.9%  of  the  experiments  reach  55  in  less  than  6000  trials.  Three  combinations  of 
crossover  and  mutation  rates  were  compared.  The  forward  seed  (F)  to  start  the  random  process  was 
(123456789)  and  the  reverse  seed  (R)  was  (987654321).  Means  and  standard  deviations  were  recorded  for 
both  the  standard  genetic  search  and  the  genetic  search  with  match  maker.  Table  2.  records  the  results. 


Crossover 

Mutation 

Seed 

Standard  GA 

Match  Maker 

a 

U 

o 

0.2 

0.1 

F 

952 

793 

618 

719 

0.2 

0.1 

R 

943 

803 

558 

641 

0.4 

0.1 

F 

1029 

909 

541 

526 

0.4 

0.1 

R 

1083 

974 

507 

503 

0.6 

0.1 

F 

1105 

954 

522 

490 

0.6 

0.1 

R 

1224 

1163 

508 

457 

Table  2.  Mean  and  standard  deviation  of  sampling  distributions. 


If  the  null  hypothesis,  Hq,  is  accepted  then  the  two  studies  being  compared  come  from  the  same  population 
and  the  difference  in  means  is  due  to  the  random  nature  of  the  sampling  process.  If  Hq  is  rejected  then 
there  is  a  difference  presumably  due  to  match  maker.  If  the  random  process  has  stabilized  then  forward  and 
reverse  samplings  at  the  same  crossover  and  mutation  rates  should  be  the  same.  The  next  section  presents 
the  results  of  the  analysis  of  this  data. 
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6.  Analysis  of  the  Statistical  Data 


The  following  paragraphs  give  results  of  the  analysis  of  the  data  collected  in  section  5.  The  data  was  tested 
at  the  0.05  significance  level  which  means  that  Hq  is  not  rejected  if  the  normalized  variable  z  is  Izl  <  1 .96 
and  is  rejected  if  Izl  >  1.96  for  a  two-tailed  test.  To  preview  the  results,  each  sample  collected  with  the  same 
crossover  and  mutation  rates  tested  from  the  same  distribution  regardless  of  forward  or  reverse  seed.  This 
indicates  that  the  331  samples  is  enough  to  make  the  process  independent  of  the  starting  seed.  A 
comparison  of  samples  under  identical  conditions,  except  for  match  maker,  shows  that  the  samples  come 
from  very  different  populations  and  that  the  results  are  highly  significant. 

Crossover  0.2  Mutation  0.1 

Comparison  of  forward  and  reverse  seeds  without  match  maker. 

z  =  0.14  Samples  from  same  distribution,  process  independent  of  seed.  Because  the 
means  are  so  close  the  chance  of  accepting  the  null  hypothesis  when  the  means  come 
from  different  populations  is  quite  high.  p  for  z  =  0.14  is  0.9434  when  a  is  0.05.  If  a  is 
relaxed  then  p  can  be  reduced. 

Comparison  of  forward  and  reverse  seeds  with  match  maker. 

z  =  1.15  Samples  from  same  distribution,  process  independent  of  seed  but  the  conclusion 
is  not  as  clear  cut  as  above.  P  is  0.79.  Because  the  separation  of  the  means  is  greater  than 
above  the  chance  of  accepting  the  null  hypothesis  when  it  should  be  rejected  is  less. 

Comparison  of  forward  seeds  with  and  without  match  maker. 

z  =  7.38  Samples  clearly  from  different  distributions.  At  this  level  of 
significance  there  is  very  little  chance  of  error. 

Comparison  of  reverse  seeds  with  and  without  match  maker. 

z  =  6.83  Samples  clearly  from  different  distributions. 

Crossover  0.4  Mutation  0.1 

Comparison  of  forward  and  reverse  seeds  without  match  maker. 

z  =  0.734  Accept  Hq.  The  samples  probably  come  from  the  same 
distribution.  Pis  0.8715. 

Comparison  of  forward  and  reverse  seeds  with  match  maker. 

z  =  0.865  Samples  from  same  distribution,  process  independent  of 
seed.  P  is  0.8619. 

Comparison  of  forward  seeds  with  and  without  match  maker. 

z  =  8.44  Samples  clearly  from  different  distributions.  At  this  level  of 
significance  there  is  very  little  chance  of  error. 
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Comparison  of  reverse  seeds  with  and  without  match  maker. 

z  =  9.56  Samples  clearly  from  different  distributions. 

Crossover  0.6  Mutation  0.1 

Comparison  of  forward  and  reverse  seeds  without  match  maker. 

z  =  1.44  Accept  Hq-  The  samples  probably  come  from  the  same 
distribution  .  (3  is  0.699. 

Comparison  of  forward  and  reverse  seeds  with  match  maker. 

z  =  0.372  Samples  from  same  distribution,  process  independent  of 
seed.  P  is  0.9342. 

Comparison  of  forward  seeds  with  and  without  match  maker. 

z  =  9.89  Samples  clearly  from  different  distributions. 

Comparison  of  reverse  seeds  with  and  without  match  maker. 

z  =  10.4  Samples  clearly  from  different  distributions. 


7.  Conclusions 

It  is  concluded  that  match  maker  has  an  effect,  that  the  results  are  not  statistical  variations  of  the  sampling 
process  and  that  match  maker  favorably  impacts  the  process.  Thus  match  maker  makes  an  important 
improvement  in  the  convergence  rate  for  this  problem.  Match  maker  should  undergo  further  testing  to  see  if 
it  can  also  improve  the  convergence  rate  in  other,  more  difficult  problems.  If  it  can,  then,  match  maker  can 
be  an  important  contribution  to  genetic  algorithms. 
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Metrics 


1.  Introduction 

The  purpose  of  this  paper  is  to  look  at  classic  metrics  to  see  which  are  appropriate  for  evaluating  the  sensor 
manager  s  schedule.  The  metric  dictates  to  some  extent  the  kind  of  scheduler  that  is  appropriate  since  for 
some  metrics  there  are  optimal  dispatch  rules  that  can  be  used  and  for  others  a  heuristic  scheduling 
approach  may  be  known  only  for  certain  metrics.  The  paper  starts  with  an  overview  of  the  sensor  manager 
scheduling  problem.  Next  several  classic  metrics  that  have  been  used  to  develop  schedules  are  summarized. 
An  evaluation  of  some  of  the  metrics  versus  the  sensor  manager’s  scheduling  problem  follows.  Finally, 
examples  showing  the  impact  of  different  metrics  on  the  similar  job  shop  scheduling  problem  are  given, 
followed  by  conclusions. 

2.  The  sensor  manager's  scheduling  problem 

Aspects  of  the  scheduling  problem  have  been  summarized  by  [Popoli,  1992],  He  lists  the  following 
problems  that  the  sensor  manager  might  have  to  deal  with: 

•  There  are  hard  deadline  tasks  that  have  to  be  completed  in  a  narrow  window  if  they  are  to  be  of 
any  value. 

•  There  are  tasks  without  deadlines  that  have  earliest  start  times  before  which  they  can't  begin 
execution. 

•  There  are  differences  in  task  priorities.  Some  tasks  are  very  important,  others  routine.  A  task's 
priority  generally  reflects  the  urgency  of  its  start  time. 

•  Some  tasks  may  be  interruptable,  others  not. 

•  Tasks  may  vary  greatly  in  length. 

•  Tasks  can  become  necessary  or  unnecessary  very  suddenly. 

•  There  may  be  some  uncertainty  in  the  requirements  for  the  task.  For  example,  the  duration  of 
the  task  may  not  be  known  when  the  task  begins  execution. 

Other  problems  not  mentioned  by  Popoli  include: 

•  Sensors  may  need  to  operate  in  parallel.  At  times  one  of  the  parallel  tasks  may  be  a  no-op 
effectively  keeping  one  sensor  from  interfering  with  the  operation  of  another  sensor. 

•  Sensors  may  need  to  operate  in  series  so  that  one  sensor  can  cross  queue  another.  This  task 
requires  sequential  operation  of  two  or  more  sensors  with  possibly  slack  time  between  operations 
to  allow  data  to  be  transferred  between  sensors. 

•  The  plant  may  change  while  the  scheduler  is  operating.  Some  sensors  may  become  inoperable 
requiring  an  emergency  rescheduling. 
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•  Workload  can  be  heavy  requiring  efficient  use  of  all  resources. 


3.  Classic  Metrics 

Unless  noted  these  metrics  come  from  [Penedo,  1995].  Generally  the  goal  of  the  scheduler  is  to  minimize 
the  value  of  these  metrics  (also  called  objective  functions). 

Makespan  (Cmax).  Makespan  is  the  maximum  of  the  completion  times  of  the  jobs  in  the  system.  It  is  the 
time  when  the  last  job  finishes.  By  minimizing  the  makespan  jobs  get  through  the  system  faster  which  leads 
to  better  utilization  of  the  machines. 

Total  weighted  completion  time  (EwjCj).  Completion  time  is  taken  as  the  time  when  a  job  finishes.  It  is 
not  referenced  to  when  the  job  starts.  Total  weighted  completion  time  measures  how  long  it  takes  to 
process  the  n  jobs  that  the  sum  runs  over.  A  longer  weighted  completion  time  means  that  there  is  more 
work-in-process  (WIP)  in  the  plant.  The  weights  account  for  different  inventory  costs  while  the  material  to 
produce  the  products  is  in  the  plant.  The  sum  of  the  completion  times  is  also  known  as  the  flow  time  and 
this  metric  may  also  be  called  the  weighted  flow  time.  A  variation  of  this  metric  involves  squaring  the 
completion  time  and  adding  a  weight  leading  to  the  weighted  sum  of  squared  completion  time  metric. 

Maximum  lateness  (Lmax).  Lateness  is  the  difference  in  time  between  the  time  when  the  job  actually 
completes  and  the  due  date  for  the  completion  of  a  job.  Positive  lateness  means  the  job  finished  late  and 
negative  lateness  means  the  job  finished  early.  The  goal  is  to  minimize  the  lateness  of  the  worst  job. 
Lateness  relates  to  customer  satisfaction  and  is  not  appropriate  when  the  jobs  do  not  have  deadlines.  If  all 
jobs  in  the  problem  have  a  common  deadline,  then  minimizing  maximum  lateness  is  the  same  as  minimizing 
makespan.  Two  variants  of  lateness  are  total  lateness,  the  sum  of  the  lateness  for  each  job,  and  total 
weighted  lateness  which  applies  a  weight  to  each  job’s  lateness. 

Total  weighted  tardiness  (ZwjTj).  Tardiness  is  the  maximum  of  lateness  or  0.  It  recognizes  that  in  some 
cases  there  are  penalties  for  being  late  but  no  benefits  for  being  early.  The  total  weighted  tardiness  sums  the 
tardiness  for  each  job  in  the  n  jobs  being  considered  in  this  planning  cycle.  Jobs  must  have  deadlines  for 
this  metric  to  be  useful.  Other  variations  on  this  same  theme  include  letting  all  of  the  weights  equal  1,  only 
considering  the  maximum  tardiness  or  summing  the  weighted  tardiness  squared  (XwjTj^). 

Weighted  number  of  tardy  jobs  (XwjUj).  Uj  is  either  one  or  zero.  If  the  weights  are  all  one  then  the 
weighted  number  of  tardy  jobs  is  just  the  number  of  tardy  jobs.  This  measure  is  simple  to  collect  and 
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understand  so  is  frequently  used  in  practice.  It  is  a  measure  of  total  customer  satisfaction  but  is  not 
sensitive  to  how  mad  customers  might  be  if  their  jobs  are  very  late. 

Sequence  dependent  setup  times  (Ssyj,).  Sjjj,  is  the  sequence  dependent  setup  time  between  job  j  and  job  k 
for  machine  i.  Minimizing  this  measure  increases  throughput  and  decreases  slack  time  between  jobs. 

Idleness  (Zip.  Idleness  is  the  time  between  the  finish  and  the  start  of  a  job  that  is  not  devoted  to 
processing.  It  includes  the  dead  time  when  the  jobs  are  waiting  to  get  on  a  machine  and  the  sequence 
dependent  setup  times.  It  is  computed  by  taking  the  completion  time  of  a  job  and  subtracting  the  start  time 
and  all  of  the  time  devoted  to  processing.  Idleness  is  also  known  as  slack  time. 

Economic  makespan  (ER^C^-ap).  The  economic  makespan  metric  [Morton  and  Pentico,  1993]  builds 
schedules  that  reflect  the  cost  of  machine  utilization.  A  price,  R^,  is  estimated  for  running  the  machine. 
This  price  is  multiplied  times  the  completion  time,  C^,  of  the  last  job  on  that  machine  minus  the  earliest 
time,  aj,,  that  the  machine  is  available.  This  objective  function  is  good  in  a  heavily  loaded  shop  where  new 
jobs  are  always  coming  in  so  that  the  idle  time  at  the  end  of  the  current  planning  horizon  for  a  particular 
machine  can  be  filled  by  the  new  jobs.  It  is  generally  equivalent  to  minimizing  the  internal  slack  in  the 
system. 

Discounted  total  weighted  completion  time  (Zwj(l-e_r^-p).  In  the  discounted  total  weighted  completion 
time  the  jobs  that  take  longer  to  complete  contribute  a  larger  percentage  of  their  weight,  wj,  to  the  metric. 
As  a  measure,  it  favors  finishing  the  longer  jobs  at  the  cost  of  taking  longer  to  do  the  shorter  jobs. 

Total  weighted  earliness  (ZPjEj^).  [Czerwinski  and  Luh,  1994]  estimate  the  theoretical  part  start  date  by 
backtracking  along  the  critical  path  from  the  due  date.  Earliness  is  defined  as  the  time  ahead  of  the 
theoretical  part  start  date  that  the  part  actually  starts  production.  There  is  no  penalty  for  starting  after  the 
theoretical  start  date.  Squaring  the  earliness  function  causes  the  penalty  to  increase  faster  the  larger 
earliness  becomes. 

Number  of  processors  ( n ).  If  there  are  more  machines  available  then  jobs  then  this  schedule  tries  to 
minimize  n,  the  number  of  machines  used.  This  objective  function  is  usually  associated  with  a  shop  that  has 
several  identical  machines  operating  in  parallel. 
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4.  Discussion  of  individual  metrics 


Makespan  is  probably  the  most  frequently  used  metric  for  schedule  generation.  [Kober,  1994]  has 
suggested  that  some  metric  other  than  makespan  may  have  some  value  for  the  sensor  manager  scheduler. 
[Stankovic,  et.  al.,  1995]  believes  that  makespan  is  important  in  that  it  minimizes  required  system  resources 
but  since  it  does  not  directly  address  deadlines  it  can't  be  a  primary  metric.  Stankovic  maintains  that  for 
real-time  systems  the  sum  of  completion  times  is  not  important  because  it  doesn't  have  a  direct  assessment 
of  timing  properties  like  deadlines  and  periods.  He  sees  value  in  the  weighted  sum  of  completion  times 
since  it  recognizes  that  some  jobs  have  more  value  than  others.  He  also  feels  that  value  and  deadlines  are 
both  important  and  that  many  real-time  scheduling  systems  concentrate  only  on  deadlines. 

Minimizing  maximum  lateness  doesn't  prevent  tasks  from  being  late.  It  is  possible  to  construct  an  example 
that  minimizes  the  maximum  lateness  yet  has  all  tasks  miss  their  deadlines,  [Stankovic,  et.  al.,  1995]. 
Optimizing  the  total  lateness  or  total  weighted  lateness  may  help  but  it  also  might  cause  more  tasks  to  miss 
their  deadlines  while  trying  to  reduce  the  lateness  of  a  very  tardy  task.  Tardiness  has  the  same  problem  as 
lateness.  By  itself  it  isn't  sensitive  to  the  number  of  late  jobs.  For  a  real-time  operation,  being  on  time  is  the 
most  important  measure  of  the  effectiveness  of  a  schedule.  Minimizing  the  number  of  tardy  jobs  or  the 
weighted  number  of  tardy  jobs  helps  correct  this.  However,  since  the  sensor  manager  is  dealing  with  a  task 
set  that  may  contain  many  jobs  without  deadlines,  in  cases  where  processor  utilization  is  equally  important, 
objective  functions  that  are  only  sensitive  to  deadlines  are  not  totally  adequate. 

Minimizing  either  sequence  dependent  setup  times  or  idleness  is  a  direct  measure  of  throughput  through  the 
system.  While  these  metrics  increase  efficiency  they  do  not  directly  assess  a  schedule's  ability  to  meet 
deadlines. 

The  total  weighted  earliness  metric  penalizes  a  job  for  starting  early.  The  presumption  here  is  that  jobs  that 
start  too  early  end  up  staying  in  the  plant  too  long  which  increases  the  work-in-process  (WIP)  costs.  This 
metric  is  generally  used  in  conjunction  with  a  metric  that  senses  completion  time  so  that  the  job  has  a 
window  of  opportunity  in  which  to  move  through  the  plant. 

The  economic  makespan  is  similar  to  idleness  except  that  idleness  is  based  on  how  long  jobs  set  idle 
whereas  economic  makespan  is  based  on  how  long  machines  are  idle.  Like  several  of  the  other  metrics,  it 
has  no  sense  of  which  jobs  need  to  be  completed  to  meet  deadlines.  Plant  efficiency  may  have  to  be 
sacrificed  in  order  to  complete  certain  jobs  on  time.  This  metric  has  the  distinction  of  including  a  weight 
measuring  the  importance  of  each  machine.  For  example,  the  IR  system  may  be  favored  over  the  radar  in 
low  emission  modes  so  the  weight  of  the  IR  sensor  may  be  high  in  certain  periods  of  the  mission. 
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The  discounted  total  weighted  completion  time  evaluation  function  produces  schedules  that  improves  plant 
efficiency  by  scheduling  longer  and  more  important  jobs  first.  Schedules  generated  from  this  metric  are 
similar  to  those  using  the  longest  processing  time  first  dispatch  rule.  The  difference  is  that  the  discounted 
total  weighted  completion  time  metric  includes  slack  time.  This  metric  is  useful  to  the  sensor  manager  only 
if  another  metric  is  included  which  measures  the  ability  of  the  schedule  to  meet  deadlines. 

The  last  metric,  number  of  processors,  is  not  particularly  useful  for  the  sensor  manager.  It  is  appropriate 
when  there  are  several  jobs  and  several  machines  available  to  choose  from,  and  the  jobs  can  run  in  parallel. 
As  this  is  not  the  usual  sensor  manager  architecture  the  number  of  processors  metric  would  not  be 
appropriate. 

For  a  dynamic  scheduling  algorithm  (one  that  schedules  tasks  when  they  arrive  and  the  arrival  times  are 
unknown  in  advance  to  the  scheduler)  the  choice  usually  goes  to  a  metric  for  which  an  optimal  dispatch  rule 
is  known.  These  rules  generally  exist  only  for  standard  plant  models  which  the  sensor  manager  architecture 
will  probably  not  match.  For  example  there  are  dispatch  rules  such  as  "weighted  shortest  processing  time 
first"  when  all  release  dates  and  due  dates  are  0  or  "earliest  due  date  first"  when  all  due  dates  are 
sufficiently  loose  and  spread  out,  that  optimize  the  weighted  tardiness  under  certain  very  restrictive 
conditions.  While  these  dispatch  rules  may  not  be  appropriate  for  the  configuration  the  sensor  manager 
finds  itself  in,  they  can  still  be  used  to  make  schedules  for  the  initial  population. 

The  next  section  considers  the  effect  of  combining  individual  metrics  to  improve  the  performance  of  the 
schedules. 

5.  Evaluation  of  combined  metrics 

Metrics  generally  fall  into  two  classes.  The  first  deals  with  a  schedule's  ability  to  meet  its  deadlines. 
Lateness,  tardiness,  number  of  tardy  jobs  and  their  variants  belong  to  this  class.  The  second  group  of 
objective  functions  measures  some  aspect  of  processor  utilization.  Makespan,  economic  makespan, 
completion  time,  sequence  dependent  setup  times,  earliness,  number  of  processors  and  their  variants  are 
examples.  A  good  metric  should  have  a  measure  from  each  class  if  it  is  to  address  the  needs  of  real-time 
scheduling. 

Penedo  suggests  that  one  particular  combination  of  objectives  used  in  the  process  industry  is  the 
minimization  of  total  weighted  tardiness  and  the  sum  of  sequence-dependent  setup  times.  Weighted 
tardiness  measures  a  company  s  ability  to  maintain  the  quality  of  its  service  while  the  sum  of  sequence- 
dependent  setup  times  indirectly  measures  the  plant's  efficiency  (throughput).  The  relative  importance  of 
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the  two  may  change  over  time.  When  the  workload  is  light,  tardiness  may  be  more  important,  but  when  the 
workload  is  heavy,  plant  efficiency  may  be  more  important. 

[Czerwinski  and  Luh,  1994]  also  use  a  combined  objective  function  in  their  Lagrangian  relaxation 
technique.  This  one  combines  an  earliness  with  a  tardiness  penalty.  These  penalties  together  define  a  time 
interval  in  which  an  operation  can  be  scheduled  with  zero  penalty.  The  objective  of  this  penalty  function  is 
to  ensure  on-time  part  completion  and  low  work-in-process  inventory.  By  penalizing  schedules  that  start 
early,  machine  utilization  is  improved  and  slack  time  is  reduced. 

The  next  two  subsections  evaluate  the  Pinedo  and  Czerwinski/Luh  combined  metrics  ability  to  handle  the 
sensor  manager's  scheduling  problem. 

5.1  Pinedo  pair  discussion 

Schedules  produced  by  the  Pinedo  pair,  total  weighted  tardiness  and  the  sum  of  sequence-dependent  setup 
times  will  be  evaluated  as  to  its  ability  to  solve  the  sensor  manager's  scheduling  problems  of  Section  2. 

Hard  deadlines  that  have  earliest  start  times  use  both  metrics  to  define  an  operating  window  in  which  they 
can  operate.  The  weighting  factor  takes  care  of  any  differences  in  the  tasks’  priorities.  While  the  metrics  do 
not  directly  address  whether  the  tasks  are  interruptable,  any  task  that  is  can  be  tested  against  the  metric  to 
see  if  it  produces  a  better  schedule.  Varying  task  length  does  not  directly  impact  the  metric  evaluation. 
Sudden  changes  in  a  task's  priority  require  a  separate  mechanism  in  the  scheduler  which  is  not  a  function  of 
the  metric.  For  example,  a  pop-up  job  may  be  handled  by  a  separate  reactive  scheduler  [Morton  &  Pentico, 
1993].  The  reactive  scheduler  tries  to  produce  a  feasible  schedule  that  has  a  minimal  impact  on  the  current 
working  schedule.  It  may  not  use  a  metric  at  all  and  may  be  as  simple  as  trying  to  fit  the  task  into  a  space  in 
the  existing  schedule  or  delaying  tasks  on  a  machine  by  enough  to  fit  in  the  pop-up  job.  Reactive 
schedulers  have  to  be  fast.  If  a  new  schedule  can  be  built  in  time  to  handle  the  pop-up  task  using  the  usual 
scheduling  cycle,  then  there  is  no  need  for  a  reactive  scheduler.  The  Pinedo  pair,  while  not  taking  much 
longer  than  any  other  metric,  probably  takes  too  long  to  be  considered  for  reactive  use. 

Uncertainty  in  task  requirements  means  that  there  is  some  information  that  is  needed  for  scheduling  that  is 
not  precisely  known.  Whatever  uncertainty  there  is  gets  mapped  into  uncertainty  in  the  objective  function. 

It  then  becomes  a  matter  for  the  scheduling  algorithm  to  manage  that  uncertainty.  Under  some  conditions 
there  are  some  NP-hard  problems  for  deterministic  models  that  have  known  optimal  solutions  for  stochastic 
models.  One  of  these  results  is  for  the  single  machine  problem  when  all  deadlines  are  equal  and  the  metric 
is  the  sum  of  the  weighted  tardiness.  This  may  be  useful  information  to  the  scheduling  algorithm  whenever 
the  Pinedo  pair  is  chosen. 
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The  next  two  bullets  from  Section  2.  concern  sensor  architecture,  the  modeling  of  the  sensors  in  series  or 
parallel  or  a  combination  of  both  (job  or  flow  shops).  Sensor  architecture  only  affects  the  Pinedo  pair  in 
that  there  are  limited  configurations  and  conditions  for  which  optimal  scheduling  algorithms  are  known.  In 
particular,  one  metric  of  the  Pinedo  pair,  the  sum  of  weighted  tardiness,  is  NP-hard  in  the  strong  sense  even 
for  one  machine.  This  means  that  it  is  unlikely  that  a  general  algorithm  can  be  found  that  gives  an  optimal 
schedule  in  polynomial  time  and  suggests  that  a  heuristic  scheduling  technique  like  genetic  search  or 
Lagrangian  relaxation  is  required  to  develop  good  schedules. 

The  next  problem,  plant  change,  really  doesn't  affect  the  Pinedo  pair.  For  example,  if  one  sensor  goes  off¬ 
line  while  the  schedule  is  operating,  the  scheduler  may  have  to  go  into  a  short  rescheduling  sequence  using 
a  less  demanding  metric.  This  is  the  same  as  letting  the  reactive  scheduler  take  over  for  the  short  term 
before  transitioning  back  into  the  long  range  scheduler. 

Heavy  workload  presents  its  own  problems.  The  ideal  schedule  is  one  that  has  the  plant  working  at  peak 
efficiency.  The  weighted  sum  of  tardiness  is  appropriate  under  heavy  workload  conditions  since  it  allows 
the  most  important  jobs  to  be  scheduled  first  and  less  important  jobs  to  be  left  to  later.  Minimizing  the 
setup  times  reduces  sensor  idle  time  which  helps  throughput  when  the  workload  is  heavy.  The  setup  times 
metric  may  not  be  as  appropriate  as  makespan  or  economic  makespan  which  more  directly  address 
processor  utilization. 

5.2  Czerwinski  and  Luh  discussion 

The  Czerwinski  and  Luh  metric  combines  tardiness  and  earliness  penalties  for  operations.  They  use  the 
following  formula  summed  over  all  jobs: 


Z(wjTj2  +  PjEj2) 


In  their  formulation,  the  squaring  of  the  tardiness  and  earliness  metric  penalizes  the  later  or  earlier  jobs 
more.  This  metric  seems  to  be  superior  to  the  Pinedo  pair  against  the  first  criteria,  hard  deadlines.  While  it 
forces  jobs  into  a  window  of  opportunity  which  is  what  the  first  bullet  requires,  it  also  works  directly  on  the 
second  bullet  which  requires  jobs  to  start  after  a  certain  time.  This  metric  is  superior  to  Pinedo  on  this  point 
as  Pinedo  addresses  the  start  time  only  as  a  second  order  effect.  A  possible  problem  with  the 
Czerwinski/Luh  metric  is  that  if  some  of  the  start  times  can't  be  violated,  this  objective  function  penalizes 
but  does  not  prevent  jobs  from  starting  early.  As  with  Pinedo,  weighting  takes  care  of  differing  task 
priorities. 
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The  Czerwinski  and  Luh  metric  has  no  known  optimal  dispatch  rule  for  any  interruptable  system  (or  for  any 
system  for  that  matter).  This  means  that  a  heuristic  is  required  to  develop  schedules.  Varying  length 
schedules  pose  no  problem  for  the  metric.  Pop-up  jobs  have  to  be  handled  by  a  reactive  scheduler  as  was 
done  for  Pinedo.  Uncertainty  in  the  knowledge  of  start  times  for  due  dates  is  reflected  in  uncertainty  in  the 
evaluation  of  a  schedule.  The  Pinedo  pair  may  have  an  advantage  here  since  the  setup  times  should  be 
determined  in  advance  and  the  type  of  job  to  be  scheduled  has  to  be  known  as  a  minimum.  Thus  it  is  likely 
there  are  less  unknowns  in  the  Pinedo  pair. 

The  other  problems  that  the  scheduler  may  have  to  handle  are  similar  for  both  metric  combinations  with  the 
exception  of  the  last  bullet  which  is  heavy  workload.  By  squaring  the  tardiness  and  earliness  penalties, 
Czerwinski  and  Luh  get  a  tighter  control  of  throughput  and  should  be  able  to  develop  schedules  that  make 
more  efficient  use  of  the  sensors. 

5.3  Recommendation 

Inclusion  of  both  classes  of  metrics  is  needed  to  produce  a  balanced  result  and  so  the  recommendation  is  to 
choose  one  metric  from  each  class  and  build  schedules  using  the  combined  metric.  Which  metrics  to  use 
depend  on  the  details  of  the  avionics  architecture  used  for  the  sensor  suites  and  may  vary  with  the  intended 
mission.  Of  the  two  combined  metrics  considered,  the  Czerwinski  and  Luh  combination  seems  a  better 
choice. 

6.  Comparison  of  Example  Schedules  Generated  by  Different  Metrics 

As  a  simple  demonstration  of  the  impact  of  metrics  on  schedules,  consider  the  6x6  job  shop  scheduling 
problem  (JSSP),  [Muth  &  Thompson,  1963].  In  the  6x6  JSSP  there  are  6  machines  and  6  jobs.  Each  job  is 
processed  on  all  six  machines  but  each  in  a  different  order.  Three  schedules  were  generated  using  different 
evaluation  functions.  The  three  metrics  used  were  makespan,  sum  of  squared  completion  times  and  a 
combination  of  makespan  and  sum  of  squared  completion  times.  An  evaluation  of  55  is  the  known  optimal 
solution  for  makespan  and  is  easily  found  by  the  genetic  algorithm  (GA).  One  of  the  many  schedules  with  a 
makespan  of  55  was  chosen  to  evaluate  the  sum  of  squared  completion  times.  The  sum  of  squared 
completion  times  takes  the  time  when  each  of  the  six  jobs  in  the  6x6  JSSP  completes,  squares  the 
completion  times  and  adds  them.  If  the  due  date  for  each  job  is  0  then  the  completion  date  is  the  same  as 
the  tardiness  and  the  sum  of  squared  completion  times  is  the  same  as  the  sum  of  squared  tardiness.  Table  1. 
lists  the  makespan  and  sum  of  squared  completion  times  for  all  three  schedules.  The  schedules  in  Table  1 . 
are  individual  schedules  arbitrarily  chosen  from  the  list  of  those  schedules  minimizing  their  performance 
metric. 
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Measure  of  Performance 

Schedule  Evaluation  Function 

Makespan 

Squared  Completion 

Combined 

Makespan 

55 

64 

60 

Squared  Completion 

15607 

13191 

13298 

End  Time 

Job  1 

48 

26 

26 

Job  2 

52 

64 

60 

Job  3 

55 

56 

56 

Job  4 

54 

59 

58 

Job  5 

53 

29 

29 

Job  6 

43 

31 

41 

Idle  Time 

108 

68 

73 

Table  1.  Comparison  of  schedules  produced  by  three  different  metrics. 


The  makespan  schedule  found  the  optimal  value  of  makespan  but  at  the  expense  of  the  sum  of  completion 
times  squared.  Looking  at  the  end  times  of  the  jobs,  the  makespan  metric  tends  to  balance  the  completion 
times  of  all  jobs.  There  is  no  premium  for  completing  jobs  early.  On  the  other  hand,  the  sum  of  squared 
completion  times  tries  to  complete  jobs  as  soon  as  possible.  If  the  environment  is  such  that  machines  set 
idle  after  a  job  is  done  waiting  for  the  next  surge,  then  makespan  is  a  better  metric  since  the  last  of  the  jobs 
completes  earlier  and  the  next  sequence  can  start  sooner.  If  there  are  jobs  coming  in  that  could  be 
scheduled  on  a  machine  when  it  is  available  then  the  sum  of  squared  completion  times  finishes  some  jobs 
early  and  frees  the  machines  for  new  jobs.  For  example,  job  1  finishes  at  48  for  the  makespan  schedule,  but 
at  26  for  the  sum  of  squared  completion  times.  If  there  are  new  jobs  that  could  use  machine  1  but  are  not  in 
the  original  schedule,  then  they  could  start  at  26  with  the  squared  completion  times  schedule.  Job  5  shows 
similar  results. 

The  third  metric  combines  makespan  and  squared  completion  times.  The  makespan  is  squared  in  order  to 
get  the  units  of  makespan  the  same  as  squared  completion  times.  The  result  is  a  compromise  between  the 
makespan  and  squared  completion  time  schedules.  Compared  to  the  previous  schedules,  the  makespan  is 
about  half-way  between  and  the  sum  of  squared  completion  times  is  nearly  as  good  as  the  sum  of  squared 
completion  time  schedule. 

Another  way  to  compare  schedules  is  on  the  basis  of  idle  or  slack  time  (defined  as  the  completion  time 
minus  the  actual  total  time  on  all  machines  required  for  a  job).  For  example,  job  1  on  the  makespan 
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schedule  completes  at  48.  If  it  is  scheduled  without  any  other  jobs  it  would  take  26  units  to  complete  by 
going  directly  from  one  processor  to  another.  Thus  job  1  sets  idle  for  22  units  sometime  during  its 
processing.  Table  1.  shows  the  idle  time  for  each  schedule.  For  these  schedules  the  sum  of  squared 
completion  times  produces  a  schedule  that  has  the  machines  setting  idle  for  the  least  time,  although  the 
combined  metric  does  almost  as  well. 

As  another  example,  consider  the  schedules  produced  by  the  dispatch  rules,  shortest  processing  time  (SPT) 
first  and  longest  processing  time  first  (LPT).  In  SPT  the  job  with  shortest  execution  time  which  has  an  open 
machine  and  has  its  previous  task  completed  is  run  next.  LPT  is  the  same  as  SPT  except  the  job  with  the 
longest  processing  time  is  considered  first.  The  SPT  rule  is  optimal  for  a  flow  shop  where  each  of  m 
machines  are  in  series,  every  job  must  be  processed  on  every  machine,  each  machine  takes  the  same  amount 
of  time  to  process  job  j,  and  the  criteria  is  to  minimize  makespan.  LPT  optimizes  makespan  on  m  parallel 
machines  by  assigning  the  m  longest  jobs  to  the  m  machines.  After  that,  whenever  a  machine  is  freed,  the 
longest  unscheduled  job  is  put  on  that  machine.  Table  2.  shows  the  results  of  the  SPT  and  LPT  rules 
applied  to  the  JSSP. 


Table  2.  Schedules  produced  by  dispatch  rules. 

SPT  doesn’t  produce  a  particularly  good  schedule  for  either  of  the  two  metrics  considered  here.  The  LPT 
rule  does  even  worse.  Jobs  take  longer  and  set  idle  more  than  with  any  of  the  previous  schedules.  This 
example  illustrates  the  trouble  with  most  dispatch  rules  when  used  in  situations  different  than  those  in  which 
they  were  designed  to  operate.  Performance  is  generally  poor. 
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7.  Conclusions 


This  paper  has  examined  the  impact  of  metrics  on  schedules.  The  sensor  manager's  scheduling  problem  was 
described.  Several  classic  metrics  were  listed  along  with  the  intended  performance  goal  of  the  metric.  It 
was  suggested  that  metrics  generally  fall  into  two  classes  and  that  a  good  schedule  could  be  built  by  taking  a 
metric  out  of  each  class.  Two  combined  metrics  were  evaluated  as  to  their  suitability  for  the  sensor 
manager  problem.  Examples  of  schedules  built  by  three  different  metrics:  makespan,  sum  of  squared 
completion  times  and  a  combination  of  both  metrics  were  presented.  They  were  evaluated  based  on  the 
makespan  and  sum  of  squared  completion  times  metrics  and  also  by  a  third  metric,  idleness,  that  none  of  the 
schedules  were  designed  to  optimize.  Finally  two  common  dispatch  rules,  SPT  and  LPT,  were  used  to 
produce  schedules  that  were  based  on  the  three  previous  metrics. 

As  expected,  a  schedule  that  is  tuned  to  one  metric  generally  performs  poorly  in  a  different  metric.  Thus 
the  sensor  manager  problem  needs  to  be  carefully  analyzed  so  that  the  right  metric  is  chosen.  The  schedule 
generated  by  the  combined  makespan  and  sum  of  squared  completion  times  was  a  compromise  between  the 
schedules  produced  by  the  individual  metrics.  If  the  due  dates  of  all  jobs  are  considered  to  be  the  beginning 
of  the  scheduling  period,  then  the  sum  of  squared  completion  times  is  the  same  as  squared  tardiness  and  the 
combined  metric  example  is  an  example  of  a  metric  taken  from  each  class.  The  result  is  a  schedule  that 
combines  features  of  each  of  the  individual  metrics  to  produce  a  good  schedule  but  not  one  that  maximizes 
the  performance  of  any  one  individual  metric. 
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OPTICAL  STUDIES  OF  TWO  NOVEL  ELECTRO-EXPLOSIVE  DEVICES 


Robert  R.  Criss 
Assistant  Professor 
Department  of  Physics 
Randolph-Macon  Woman’s  College 

Abstract 

High  speed  streak  photography  and  time  integrated  spectroscopy  were  used  to  characterize  optical 
emissions  from  plasmas  formed  by  two  novel  electro-explosive  devices;  a  semiconductor  bridge  (SCB),  and  a 
semiconductor  junction  igniter  (SJI).  The  spatial  extent  and  expansion  velocity  of  each  plasma  were  determined  as  a 
function  of  time.  An  estimation  of  ion/neutral  temperature  was  made  by  assuming  that  the  identity  of  the 
ion/neutral  was  known  and  that  the  plasma  was  expanding  freely  in  the  plane  parallel  to  the  plane  of  the  device.  Time 
integrated  spectra  of  the  plasmas  were  complex  and  analysis  is  not  complete  at  this  time. 
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OPTICAL  STUDIES  OF  TWO  NOVEL  ELECTRO-EXPLOSIVE  DEVICES 

Robert  R.  Criss 


Introduction 

Semiconductor  bridges  (SCB’s)  and  semiconductor  junction  igniter’s  (SJI’s)  are  electro-explosive  devices 
used  to  provide  ignition  of  energetic  compounds  such  as  pyrotechnics  and  propellants.  They  can  be  used  in 
products  ranging  from  automotive  airbags  to  ordinance  systems.  The  detailed  structure  of  these  devices  has  been 
published  elsewhere  and  will  not  be  discussed  in  this  report1 2  A  current  pulse  applied  to  the  SCB/SJI  quickly  heats 
the  device,  producing  an  ionized  plasma  which  is  used  to  ignite  an  energetic  compound.  Though  the  exact 
mechanisms  responsible  for  ignition  are  unknown,  one  might  reasonably  expect  them  to  be  related  to  plasma 
properties  such  as  the  identity  of  species,  temperature,  density,  expansion  velocity,  and  amount  of  ionization.  A 
study  of  optical  emissions  from  the  plasma,  taken  in  conjunction  with  current-voltage  measurements,  offers  the 
promise  of  determining  all  of  the  aforementioned  plasma  properties.  Our  optical  studies  consisted  of  high  speed 
streak  photography  and  time  integrated,  multichannel  emission  spectroscopy.  A  preliminary  analysis  of  the  results  is 
presented  in  this  report 
Methodology 

The  procedures  used  to  collect  data  were  the  same  for  both  SCB  and  SJI  devices.  All  of  the  devices  were 
fired  with  a  current  pulse  from  a  0.2/iF  capacitor  charged  to  lkV.  Multiple  SCB’s  (and  SJI’s)  were  necessary  to  collect 
a  complete  data  set  Since  the  I-V  measurements  from  all  of  the  SCB’s  were  nearly  identical,  it  has  been  assumed 
that  the  plasma  properties  were  reproducible  from  shot  to  shot  This  was  also  the  case  for  the  SJI’s. 

A  bi-convex  lens  was  used  to  focus  an  image  of  the  device  onto  the  entrance  slit  of  the  streak  camera.  Streak 
pictures  ( covering  a  2n&  time  interval)  of  the  luminous  plasma  were  taken  with  the  plane  of  the  device  parallel  to  the 
plane  of  the  camera  entrance  slit  (front  view)  and  with  the  plane  of  the  device  perpendicular  to  the  plane  of  the 
camera  entrance  slit  (side  view).  Intensity  contours  from  pictures  taken  with  the  device  in  each  orientation 
determine  the  location  of  the  luminous  plasma  as  a  function  of  time.  An  intensity  contour  from  the  front  view 
pictures  was  used  to  determine  the  extent  of  the  luminous  plasma  in  the  plane  parallel  to  the  plane  of  the  device  as  a 
function  of  time.  When  this  location  is  plotted  against  the  time,  the  slope  of  the  curve  yields  the  parallel  expansion 
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velocity  (this  is  the  velocity  of  the  plasma  in  the  plane  parallel  to  the  plane  of  the  device).  Likewise,  pictures  taken 
from  the  side  view  yield,  as  a  function  of  time,  the  extent  of  the  luminous  plasma  in  the  plane  perpendicular  to  the 
device  and  the  perpendicular  expansion  velocity. 

Spectra  were  collected  with  a  1 -dimensional,  computer  interfaced  diode  array  coupled  to  the  exit  of  a 
grating  spectrometer.  Each  device  was  aligned  with  the  plane  of  the  device  parallel  to  the  plane  of  the  spectrometer 
entrance  slit  Spectra  were  collected  from  200nm  to  700nm  in  intervals  of  approximately  lOOnm. 

Results 

Typical  front  and  side  view  streak  pictures  of  the  plasma  formed  by  an  SCB  are  shown  in  figure  1.  A  contour 
was  traced  along  the  edge  of  each  picture  and,  from  this  contour,  data  points  were  taken  for  the  plots  of  the  spatial 
extent  of  the  SCB  plasma  as  a  function  of  time,  shown  in  figure  2.  Sixth  order  polynomials  were  fit  to  each  curve. 

The  derivatives  were  then  used  to  generate  plots  of  the  instantaneous  expansion  velocities  as  a  function  of  time, 
shown  in  figure  3.  If  the  expansion  of  the  plasma  is  assumed  to  be  a  free  expansion,  and  if  the  identity  of  the  plasma 
species  is  known,  the  equipartition  of  energy  theorem  can  be  used  to  estimate  the  plasma  ion/neutral  temperature. 
Under  these  conditions,  the  temperature  can  be  written  as 

T « mv2/3k  .  (1) 

where  T  is  the  temperature,  m  is  the  mass  of  the  plasma  species,  k  is  Boltzmann’s  constant,  and  v  is  the  velocity. 
Aluminum  and  silicon  are  the  species  present  in  the  plasma  (this  conclusion  can  be  drawn  from  knowledge  of  device 
construction  and  confirmed  spectroscopically).  The  condition  of  ’free  expansion’  would  most  likely  be  met  for 
plasma  expansion  in  the  plane  parallel  to  the  plane  of  the  device  and  at  some  time  after  the  initial  burst  of  plasma 
formation.  Using  the  parallel  expansion  velocity  of  the  SCB  plasma  at  a  time  0.2  /as  after  the  initial  burst  yields  a 
temperature  of  693  K  for  an  aluminum  plasma  and  720  K  for  a  silicon  plasma.  A  volume  of  0.09  mm3  was 
calculated  for  the  plasma  at  this  time(0.2  /as)  by  using  the  equation  for  the  volume  of  a  right  circular  cone  and 
dimensions  taken  from  figure  2. 

Typical  front  and  side  view  streak  pictures  of  the  plasma  formed  by  an  SJI  are  shown  in  figure  4.  Plots  of  the 
spatial  extent  of  the  SJI  plasma  and  the  instantaneous  expansion  velocities,  generated  by  the  procedure  outlined 
above,  are  shown  in  figures  5  and  6.  Substituting  the  parallel  expansion  velocity  of  the  SJI  plasma  at  a  time  0.2  /as 
after  the  initial  burst  into  equation  (1)  yields  a  temperature  of  1309  K  for  an  aluminum  plasma  and  1362  K  for  a 
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silicon  plasma.  A  volume  of  0.19  mm3  was  calculated  for  the  plasma  at  this  time(0.2  (is)  by  using  the  equation  for 
the  volume  of  a  right  circular  cone  and  dimensions  taken  from  figure  5. 

Spectra  collected  from  both  devices  should  confirm  the  identification  of  species  in  the  plasma  and  might  be 
suitable  for  determination  of  the  plasma  electron  temperature.  The  analysis  has  not  been  completed  at  this  time  but 
will  be  presented  for  publication  at  a  later  date.  Values  for  the  resistivity,  calculated  from  the  current-voltage  traces, 
might  also  be  used  in  conjunction  with  this  temperature  to  determine  the  fractional  ionization  of  the  plasma. 
Summary  &  Conclusions 

Streak  photography  has  been  used  to  determine  the  location  and  expansion  velocity  of  luminous  plasmas 
from  SCB  and  SJI  devices.  Expansion  velocities  were  measured  both  parallel  and  perpendicular  to  the  plane  of  the 
device.  In  each  case,  the  velocity  started  at  a  maximum  and  decreased  most  sharply  in  the  first  O.ljus  of  the  plasma 
discharge.  For  both  devices,  the  initial  parallel  expansion  velocity  was  slightly  larger  than  the  initial  perpendicular 
expansion  velocity.  This  may  be  due  to  shock  effects  or  errors  in  alignment  of  the  device  with  the  streak  camera  slit 
The  effect  of  contrast  enhancement  of  the  streak  pictures  on  the  results  of  the  image  analysis  also  needs  to  be 
considered.  Further  data  analysis,  which  includes  the  consideration  of  shock  effects,  should  resolve  questions  about 
the  validity  of  the  temperature  measurement  discussed  in  this  report 

Though  not  presented  in  this  report,  analysis  of  the  spectral  data  is  ongoing.  A  tentative  summary  of  those 
results  is  presented  in  this  report  Spectral  emissions  from  aluminum  and  silicon  have  been  identified.  It  seems 
possible  that  at  least  some  of  the  features  appearing  in  the  spectra  are  from  the  air  surrounding  the  plasma  discharge. 
Calculations  of  the  plasma  density  as  a  function  of  time  should  suggest  at  what  stage  of  the  discharge  features  from 
the  surrounding  air  would  be  expected  to  appear.  Spectra  collected  when  the  device  is  in  a  vacuum  could  also  assist 
in  the  identification  of  spectral  peaks.  It  should  be  noted  that  all  of  the  spectra  collected  were  integrated,  in  time, 
over  the  full  duration  of  the  plasma  discharge. 

We  are  still  at  too  early  a  stage  of  data  analysis  to  compare  the  results  from  the  measurements  on  SCB 
plasmas  to  those  on  SJI  plasmas.  Indeed,  most  of  the  analysis  not  yet  complete.  An  ion/neutral  temperature  and  an 
estimation  of  plasma  density  should  be  possible  from  the  streak  pictures,  used  in  conjunction  with  knowledge  of 
device  geometry  and  some  reasonable  assumptions.  The  spectra  have  already  confirmed  the  presence  of  aluminum 
and  silicon  in  the  plasma.  It  might  be  possible  to  calculate  a  temperature,  approximating  the  electron  temperature, 
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from  the  spectra.  If  so,  this  temperature  could  be  used  in  conjunction  with  the  plasma  resistivity  (taken  from  the  I-V 
curves)  to  estimate  the  fractional  ionization  of  the  plasma.  The  complete  analysis  will  be  submitted  for  publication 
in  professional  journals  at  a  later  date. 
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Abstract 

The  production  of  metallic  materials  with  controlled  degrees  of  anisotropy  is  important 
because  the  controlled  texture  provides  significant  assurance  that  subsequent  plastic 
deformation  can  be  performed  successfully  and  reproducibly.  To  proceed  to  this  condition, 
the  degree  of  anisotropy  must  be  quantified  by  the  experimental  determination  of  x-ray  pole 
figures  and  the  orientation  distribution  functions  (ODF)  calculated  from  the  pole  figure  data. 
To  insure  the  intensity  data  collected  is  reliable  the  grain  size  of  the  specimen  must  be  suitably 
small.  The  objective  of  this  research  was  to  determine  the  grain  size  error  in  measurement  of 
pole  figures  on  the  x-ray  diffractometer  at  Eglin  AFB. 
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GRAIN  SIZE  EFFECTS  IN  THE  DETERMINATION 


OF 

X-RAY  POLE  FIGURES 
AND 

ORIENTATION  DISTRIBUTION  FUNCTIONS 


Robert  J.  De  Angelis 


Introduction 

Since  the  1980's  the  description  of  material  textures  or  crystal  orientations  in  polycrystalline 
wires  and  sheets  started  to  move  beyond  the  x-ray  determined  pole  figures,  the  norm  since  the 
1930’ s.  The  orientation  distribution  function  (ODF)  has  recently  become  the  method  of  choice 
for  presenting  the  description  of  material  textures.  This  change  was  driven  by  the  need  for  a 
more  complete  method  of  describing  and  quantifying  textures  in  the  tailor  making  of  materials 
and  was  made  possible  by  the  advances  in  computer  speed  and  capacity.  The  ODF  is 
computed  from  pole  density  data  usually  represented  on  pole  figure  plots  which  give  the 
density  of  poles  (normals  to  a  crystallographic  plane)  of  a  specific  {hid}  plane  in  units  of 
multiples  of  the  random  pole  density  for  that  particular  {hkl}  pole.  Pole  figures  are  usually 
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represented  on  two  dimensional  polar  coordinate  plots  where  the  center  of  the  pole  figure  is  the 
normal  to  the  sheet  surface  and  the  direction  to  the  right  is  parallel  to  the  rolling  direction. 
Therefore  it  is  clear  that  the  pole  figure  gives  the  position  of  the  orientation  of  the  specified 
{hid}  poles  relative  to  and  axial  system  based  on  the  sample;  e.g.  rolling  plane  and  rolling 
direction  in  the  case  of  sheet  textures  or  wire  axis  direction  in  the  case  of  wire  textures.  The 
ODF  gives  the  statistical  distribution  of  crystallites  in  a  small  range  of  orientations  plotted  on 
an  angular  axial  system  related  to  the  crystallographic  axis.  A  great  advantage  to  the  ODF 
method  of  texture  representation  is  that  the  coefficients  of  the  harmonic  equations  employed  to 
describe  the  function  provide  weighing  factors  for  the  determination  of  the  anisotropic  elastic 
and  plastic  properties  of  the  material.  These  coefficients  also  are  sensitive  to  orientation 
changes  that  occur  during  certain  solid  state  processes,  such  as;  phase  transformations,  plastic 
deformation  and  reciystallization. 

However  there  has  been  little  concern  over  the  errors  in  measurement  when 
determining  pole  figures.  There  exist  two  contributors  which  determine  the  statistical 
relevance  of  pole  figure  measurements;  the  number  of  grains  irradiated  in  the  diffraction 
experiment  and  the  photon  detection.  The  main  objective  of  this  work  is  to  determine  the 
grain  size  error  in  pole  figure  determinations. 

Problem  Description: 

There  is  a  statistical  error  based  on  the  number  of  grain  diffracting  associated  with  the 
determination  of  a  texture  pole  figure.  In  a  fixed  geometry  x-ray  experiment  the  number  of 
grains  in  the  beam  will  decrease  inversely  with  the  grain  size  of  the  diffracting  material.  The 
error  due  to  the  number  of  grains  diffracting  is  very  similar  to  the  statistical  error  in  counting 


15-4 


the  number  of  pulses  arriving  at  a  photon  detector.  Obviously  the  determination  of  pole 
densities  is  a  statistical  process.  Initially  let  us  consider  the  statistical  error  due  to  photon 
counting. 

If  many  measurements  of  the  number  of  photons,  n,  arriving  at  a  fixed  position  in  a 
defined  period  of  time  are  made.  The  number  n  will  fluctuate  about  an  average  value  of 
counts,  <n> .  For  a  great  enough  number  of  counts  (n  >  100)  the  distribution  of  counts 
observed  follow  a  Gaussian  distribution  with  a  standard  deviation  of  (n)1/2.  Therefore  the 
counting  error  for  one  standard  deviation  is  given  by  (n)1/2/n  or  (n)',/2.  For  example:  If  <n> 
is  100  counts  the  one  standard  deviation  error  is  0. 1  or  10  counts.  That  is,  68%  of  the  time 
the  counts  observed  will  be  between  90  and  110  counts.  In  practice  the  statistical  counting 
error  is  estimated  from  only  one  count. 

Now  turning  to  the  error  associated  with  the  number  of  grains  diffracting  these  same 
concepts  are  directly  applicable;  i.e.  the  one  standard  deviation  error  in  measurement  is 
(Ng) m,  where  Ng  is  the  number  of  grains  diffracting  at  a  fixed  position.  Except  for  a  ideal 
random  sample  the  value  of  Ng  varies  depending  on  measurement  position  due  to  type  and 
strength  of  the  texture  existing  in  the  material.  The  most  direct  method  to  account  for  these 
variations  and  evaluate  the  grain  number  error  is  by  considering  the  reciprocal  lattice  geometry 
of  the  powder  diffraction  process. 

Consider  the  total  irradiated  volume  of  the  specimen  having  a  total  of  N  randomly 
oriented  crystallites  with  an  average  volume  of  dv.  Consider  a  set  of  (hkl)  planes  oriented  for 
diffraction.  The  reciprocal  lattice  is  a  sphere  of  radius  r,  where  the  magnitude  of  r  is  l/d^. 
The  sphere  is  the  locus  of  all  normals  to  the  (hkl)  planes  from  all  the  crystallites  in  the 
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irradiated  volume.  The  Ewald  sphere  intersects  the  reciprocal  lattice  sphere  along  a  circular 
line  of  radius  (r  cos  (rc/2  -  0),  where  0  is  the  perfect  Bragg  angle  for  the  planes  (hkl)  (see  Fig. 
1).  Very  few  normals,  if  any,  that  fall  on  the  line  will  produce  crystallites  oriented  for  Bragg 
diffraction.  In  a  horizontal  diffraction  experiment  the  incident  beam  has  a  vertical  divergence 
of  cc.  This  divergence  increases  the  contact  area  of  the  Ewald  sphere  and  the  reciprocal  lattice 
sphere  from  a  line  to  a  band  of  width  da,  as  shown  in  Fig.  2.  This  immensely  increases  the 
number  of  crystallites  diffracting  (i.e.  having  normals  in  the  band  on  the  reciprocal  lattice 
sphere).  The  fraction  of  crystallites  diffracting  at  any  position  is  given  by  the  ratio  of  the  area 
of  the  band  on  the  reciprocal  lattice  sphere  to  area  of  the  total  sphere;  or: 


Ng  _  27tr2cos(9-g) 
N  4nr2 


da  =  —  cos  Qda 
2 


(1) 


Introducing  the  multiplicity,  m,  of  the  (hkl)  plane  gives: 

N  - 

— f  =  — cos  0  c/ a 
N  2 


(2) 


In  most  metallic  materials  the  diffraction  process  takes  place  in  a  specimen  depth  of  dimension 
much  less  than  the  grain  size.  Therefore  it  is  very  reasonable  to  employ  an  area  rather  than  a 
volume  consideration  in  arriving  at  the  number  of  grains  available  for  diffraction.  If  a  Pisson- 


Voronoi  mosaic  basis  is  assumed  for  describing  the  grain  size 
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distribution  with  a  mean  diameter  d,  then  the  average  area,  Ad,  of  a  spherical  grain  exposed  to 
the  beam  on  a  flat  surface  is: 

Ad  =  d2/3.om  (3) 


Now: 


N  -  AjJAd 


(4) 


where  As  ,  is  the  area  of  the  specimen  covered  by  the  incident  beam.  Then  the  number  of 


crystallites  diffracting  is  given  by: 


V 


mAj  cos  d  da 


(5) 


The  diffraction  spots  from  these  grains  are  spread  over  a  cone  of  diffraction  with  an  apex  angle 
of  40  and  having  a  length  of  R,  the  distance  from  the  specimen  to  the  detector.  The  perimeter 

of  the  circle  containing  these  diffraction  spots  is  2  nR  sin  20  at  the  position  of  the  detector. 

The  fraction  of  the  diffraction  cone  intercepted  by  the  detector  of  length  C  is: 

l/4TtRsm0  (6) 
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Giving  the  number  of  grains  diffracting  at  a  given  position  of: 


mtA,  da 

Nj  = - ± - 

g  AtiAjR  sin0 


(7) 


Results: 

To  determine  the  number  of  grains  diffracting  in  a  typical  pole  figure  determination 
experiment  on  a  Siemens  5000  diffractometer,  the  following  values  for  the  quantities  in  Eq(7) 
were  employed: 

_  height  of  la  at  the  specimen  0  1  cm 
“  R  =  =  003 


0  =  22.8° 


m  =  multiplicity  =  12 
?  =  1  cm 

A,  =  1  cm  x  0.2  cm  =  0.2  cm 2 


Ad  =  d2/3.0988 
R  =  30  cm 
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Inserting  these  quantities  gives: 


jj,  _  1.69  x  10*4  cm  _  l^xlO*4^)2 

■  *  "  d 2  "  d 2  (8) 

This  result  is  shown  graphically  in  Fig.  3  for  grain  sizes  up  to  100  microns.  As  can  be 
determined  from  Fig.  3  a  grain  size  of  50  /im  has  a  statistical  error  of  0.39,  meaning  there  is  a 
68%  probability  that  any  measurement  from  this  specimen  where;  N  is  24.8  x  103  grains,  Nt  is 

914  grains  and  Ng'  is  6.25  grains,  will  be  from  grains  ranging  in  number  from  8.95  to 

3.81.  Another  way  to  look  at  this  is,  this  specimen  could  have  pole  densities  of  between  1.39 
to  0.69  times  random. 
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FIGURE  CAPTIONS: 

Figure  1.  X-ray  Powder  Diffraction  Geometry  in  Reciprocal  Space. 

Figure  2.  The  Area  Sweep  Out  on  the  Reciprocal  Lattice  Sphere  Due  to  Incident  Beam 
Divergence. 

Figure  3.  One  Standard  Deviation  Error  in  Measurement  as  a  Function  of  Grian  Size. 
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INVESTIGATION  OF  PHOTOLUMINESCENCE  INTENSITY 
SATURATION  AND  DECAY, 

AND  NONLINEAR  OPTICAL  DEVICES 
IN  SEMICONDUCTOR  STRUCTURES 

Yujie  J.  Ding 
Assistant  Professor 
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Bowling  Green  State  University 

Abstract 

We  have  observed  saturation  of  photoluminescence  peak  at  low  pump  intensities  in 
growth-interrupted  asymmetric-coupled  Quantum-well  structure.  We  believe  the  saturation  is 
due  to  filling  of  the  exciton  states  localized  at  the  interface  islands.  We  have  observed 
increase  of  the  photoluminescence  decay  time  as  pump  intensity  increases  in  the  same  struc¬ 
ture. 

Based  on  our  design,  a  new  multilayer  structure  was  grown  for  demonstrating 
transversely-pumped  counter-propagating  optica!  parametric  oscillation  and  amplification,  and 
achieving  surface-emitting  sum-frequency  generation  in  a  vertical  cavity. 

We  have  attempted  to  mode-lock  Ti:Sapphire  laser  pumped  by  an  Argon  laser.  We  con¬ 
clude  that  stability  of  the  Argon  laser  is  crucial  for  achieving  stable  mode-locking. 
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INVESTIGATION  OF  PHOTOLUMINESCENCE  INTENSITY 
SATURATION  AND  DECAY, 

AND  NONLINEAR  OPTICAL  DEVICES 
IN  SEMICONDUCTOR  STRUCTURES 


Yujie  J.  Ding 


1.  Introduction 

Recently,  it  has  been  shown  that  by  interrupting  sample  growth  at  every  interface,  one 
can  obtain  multiple  photoluminescence  (PL)  peaks  with  separate  emission  energies  that 
correspond  to  the  excitonic  emissions  at  interface  islands  of  different  sizes  |1-4|.  Because  of 
the  formation  of  these  interface  islands,  the  well  widths  at  these  islands  generally  differ  by 
one  monolayer  with  respect  to  the  designed  width  in  high  quality  samples  15].  However,  the 
area  ratios  among  all  these  islands  of  different  well  widths  are  random,  which  cannot  be  con¬ 
trolled  in  growth  process.  (Without  the  growth  interruption,  the  recombination  of  the  carriers 
in  the  wells  with  different  widths  results  in  the  inhomogeneous  broadening  in  the  PL  spec¬ 
trum.)  At  low  temperatures,  all  the  carriers  generated  by  the  pump  will  be  eventually  relaxed 
down  to  the  lowest  energy  levels  and  localized  in  the  islands,  resulting  in  very  large  carrier 
densities.  If  the  total  area  of  the  islands  is  small,  it  would  be  possible  to  completely  fill  exci- 
ton  states  in  these  islands  at  relatively  low  intensities  that  may  manifest  as  the  saturation  of 
the  PL  peaks.  It  is  worth  noting  that  in  growth-interrupted  samples.  Band-filling  effects  are 
spatially-localized  effects,  due  to  additional  confinement  along  the  interface,  similar  to  situa¬ 
tion  in  quantum  dots  (i.e.  all  the  islands  are  spatially  isolated). 

Recently,  surface-emitting  green  light  was  obtained  [6]  by  frequency-doubling  infrared 
laser  beam  (1.06  pm)  in  the  waveguide  based  on  periodically  modulated  second-order  suscep¬ 
tibility  in  alternating  AI^Ga^As  and  AlvGaj_vAs  (x  *  v )  layers.  When  the  multilayers  are 
sandwiched  between  two  quarter-wave  stacks,  large  increase  in  the  conversion  efficiency  was 
observed  [7]  though  quasi  phase-matching  was  not  established.  Following  Ref.  |8j,  second- 
order  susceptibility  of  asymmetric-coupled  quantum-well  (QW)  domain  structures  were  meas¬ 
ured  in  the  surface-emitting  geometry  |9],  The  maximum  conversion  efficiency  so  far  is  still 
less  than  1%/W.  Recently,  we  proposed  a  novel  practical  scheme  for  implementation  of  the 
cascaded  nonlinearity  using  surface-emitting  second-harmonic  generation  (SHG)  in  the 
Fabry-Perot  cavity.  We  have  shown  that  such  scheme  can  be  efficiently  used  for  optical 
power  limiting  and  optical  phase  conjugation  at  low  input  power  [10|.  Most  recently  [11], 
we  propose  to  achieve  nearly  100%  conversion  efficiency  of  SHG  for  the  low  input  power, 
by  combining  quasi  phase-matching  and  cavity  enhancements  in  semiconductor  multilayers  or 


16-3 


asymmetric  QW  domain  structures.  Thus,  our  investigation  leads  to  the  implementation  of 
practical  frequency  doublers  which  can  cover  the  range  from  blue  to  infrared.  More  impor¬ 
tantly,  we  proposed  to  implement  tunable  optical  parametric  oscillators  (OPOs)  and  amplifiers 
[12]  based  on  a  novel  configuration.  Frequency  doublers,  optical  parametric  oscillators  and 
amplifiers,  and  the  nonlinear  optical  devices  based  on  the  cascaded  second-order  nonlineari¬ 
ties  have  potential  applications  in  generation  of  blue  light,  generation  and  amplification  of 
tunable  mid-IR  light,  optical  communication,  ultrafast  detection,  sensor  protection,  real-time 
holography,  or  optical  lithography. 

2.  Discussion  of  Problem 

a.  Saturation  of  photoluminescence  peak 

Previously,  we  observed  |13]  saturation  of  photoluminescence  peaks.  We  believe  that  it 
is  due  to  band-filling  effects  at  the  interface  islands  as  a  result  of  the  growth  interruption. 
The  intensities  required  to  observe  the  saturation  reflect  the  total  area  of  the  interface  islands, 
thus  the  interface  roughness. 

b.  Photoluminescence  decay 

To  determine  the  characteristic  carrier  density  for  completely  filling  the  interface  islands, 
one  needs  to  measure  the  carrier  recombination  times.  Furthermore,  the  dependence  of  the 
recombination  time  on  the  excitation  intensity  may  provide  information  about  nature  of  the 
recombination  processes. 

The  sample  for  studying  Sections  a  and  b  above  was  grown  by  MBE  on  a  semi- 
insulating  GaAs  substrate  at  the  temperature  of  600  C  in  collaboration  with  Naval  Research 
Labs.  The  epitaxial  layers  consist  of  20  periods,  each  of  which  is  composed  of  two  narrow 
asymmetric  coupled  GaAs  quantum  wells  with  the  designed  thicknesses  of  50  A  and  65A, 
coupled  by  SSA-Alo^Gay^As  barriers,  see  Fig.  1.  The  thicknesses  of  the  barriers  between 
the  adjacent  periods  are  150 A.  During  the  sample  growth  there  is  an  interruption  for  60 
seconds  at  every  interface.  Because  of  this  growth  interruption,  interface  islands  with  sizes 
larger  than  the  average  exciton  radius  are  formed,  allowing  excitons  being  spatially-localized 
within  these  islands  with  separate  optical  transition  energies  from  that  of  free-excitons  |1|.  As 
a  result,  in  each  designed  well  the  absorption  and/or  emission  peaks  are  separated  from  each 
other  corresponding  to  one  monolayer  thickness  (2.8  A)  difference. 

c.  Growth  of  a  new  structure 
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Previously,  we  proposed  to  use  semiconductor  multilayers  to  generate  surface-emitting 
second-harmonic  radiation  [11]  to  implement  transversely-pumped  counter-propagating  OPOs 
[11].  Recently,  we  designed  the  first  structure.  The  epitaxial  layers  consist  of  two  Bragg 
reflectors  and  alternating  layers  for  achieving  quasi-phase  matching. 

d.  Attempt  to  mode-lock  TiiSapphire  laser 

To  characterize  semiconductor  lasers  in  time-resolved  domain,  an  ultrafast  laser  source 
(i.e.  mode-locked  TiiSapphire  laser)  is  required  to  excite  the  carriers  to  the  high-energy  sub¬ 
bands.  The  relaxation  processes  can  be  then  probed  via  different  techniques. 

3.  Methodology 

a.  Photoluminescence  spectrum 

Our  asymmetric-coupled  quantum-well  structure  is  pumped  by  a  CW  Argon  laser  at  the 
wavelength  of  5145  A.  The  photoluminescence  was  collected  by  a  monochromator  via  a 
lens. 

b.  Photoluminescence  decay 

For  the  measurement  of  the  PL  decay,  we  used  a  mode-locked  Ar+  laser  as  our  excita¬ 
tion  pulse  with  the  pulse  duration  of  150  ps  and  output  wavelength  of  5145  A.  The  temporal 
traces  of  the  PL  signal  were  taken  via  a  streak  camera  w'ith  a  time  resolution  of  20  ps.  Fig. 
2  shows  our  schematic  set-up. 

c.  Design  of  a  new  structure 

Our  optimized  design  of  the  multilayer  structure  is  based  on  our  vigorous  consideration 
of  surface-emitting  frequency  doublers  [Fig.  3(a)]  and  transversely-pumped  counter- 
propagating  OPOs  and  OPAs  [Fig.  3(b)],  see  Refs.  [11,12]. 

d.  Attempt  to  mode-lock  Ti:Sapphire  laser 

The  schematic  set-up  for  Argon-laser-pumped  mode-locked  TiiSapphire  laser  and  the 
TiiSapphire  laser  cavity  are  shown  in  Fig.  4.  We  have  followed  the  manual  for  TiiSapphire 
provided  by  Clark-MXR  Inc.  The  mode  structure  and  stability  were  determined  by  eyes  after 
expanding  the  laser  beam.  The  output  laser  pulse  from  mode-locked  TiiSapphire  laser  can  be 
sent  to  Coumarin  460  for  generating  two-photon  fluorescence. 

4.  Results 
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a.  Saturation  of  photoluminescence  peak 

The  PL  spectra  for  several  pump  intensities  are  shown  in  Fig.  5.  At  laser  intensity  of 
9.7  mW/cm2  there  are  two  emission  peaks:  the  one  on  the  long  wavelength  side  (-7780  A) 
corresponds  to  the  emission  of  excitons  at  the  interface  islands  while  the  other  one  (-7773  A) 
corresponds  to  the  free  excitons.  When  we  change  the  laser  intensity  from  9.7  mW/cm2  to 
1.4  W/cm2  at  4  K,  we  can  see  that  the  emission  peak  for  the  localized  excitons  loses  its  rela¬ 
tive  strength. 

Due  to  growth  interruption,  a  single  PL  peak  breaks  into  two  because  of  the  formation 
of  interface  islands  with  the  size  larger  than  the  exciton  radius.  At  low  temperatures,  all  the 
earners  generated  by  the  pump  laser  will  be  eventually  relaxed  dow'n  to  the  lowest  energy 
levels  and  localized  in  the  islands,  resulting  in  large  carrier  densities.  If  the  total  area  of  the 
islands  is  small,  it  would  be  possible  to  completely  fill  exciton  states  in  these  islands  at  rela¬ 
tively  low  intensities,  which  manifests  as  the  saturation  of  the  PL  peaks.  This  type  of  the 
band-filling  effect  only  occurs  at  the  spatially-localized  islands.  The  laser  intensity  required 
to  almost  completely  fill  the  localized  exciton  states  is  more  than  two  orders  of  magnitude 
lower  than  that  obtained  before  1 13J. 


b.  Photoluminescence  decay 

We  have  made  the  time-resolved  PL  measurements  in  our  sample.  Fig.  6  shows  the 
typical  temporal  PL  traces  detected  at  the  center  wavelength  of  e^hj  emission  peak  (the 
excitonic  emission  peak)  as  a  result  of  the  carrier  recombination  at  the  interface  islands.  At 
the  low  excitation  intensity  (207  W/cm2),  the  PL  signal  at  the  e,hh,  (II)  emission  peak  has  a 
decay  time  of  about  269  ps.  As  the  intensity  increases,  the  decay  time  increases.  As  shown 
in  Fig.  6,  when  the  laser  intensity  is  414,  621,  and  828  W/cm2,  the  decay  time  is  about  326, 
537,  and  666  ps. 

In  the  quasi-CW  regime,  the  density  of  excitons  can  be  determined  as 


XT  *‘1J 

N„v  =  T  — 


.a 


"hC0|.tsCr 


(1) 


where  Ilaser  is  the  laser  intensity,  a  is  the  absorption  coefficient,  and  1icolascl.  is  the  energy  of  a 
single  photon.  The  intensity  required  to  completely  fill  the  e,hh,  exciton  states  is  about 
1.4W/cm2.  Assuming  a=  10000cm-1  at  the  pumping  wavelength  in  our  experiments,  the 
exciton  density  is  then  estimated  to  be  9.75x  lO^cm-2.  The  corresponding  area  density  is 
1.46  x  107cm  2.  This  carrier  density  is  more  than  two  orders  of  magnitude  lower  than  that  in 
Ref.  [13], 
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c.  Growth  of  a  multilayer  structure 

Based  on  Refs.  [11,12],  we  had  already  designed  an  optimized  structure  [see  Fig.  3(c)]. 
We  had  grown  this  structure  in  collaboration  with  Drs.  J.  L.  Loehr  and  J.  Ehret  at  Wright 
Labs.  We  will  test  the  performance  of  this  structure  as  an  efficient  frequency  doubler  and 
optical  parametric  oscillator  and  amplifier  at  Bowling  Green  State  University.  The  pump, 
input  and  output  wavelengths  are  designed  to  be  1.06  pm,  1.58  pm,  and  3.23  pm.  By  chang¬ 
ing  the  incident  angle,  one  can  tune  the  output  wavelengths  [12]. 

d.  Attempt  to  mode-lock  TiiSapphire  laser 

The  TiiSapphire  laser  is  pumped  by  a  re-furnished  Argon  laser  (Coherent  Innova  90), 
see  Fig.  4.  For  4-watt  pump  power  of  a  multi-line  Argon  laser,  conversion  efficiency  as  high 
as  20%  was  achieved  in  CW  regime.  We  had  tried  our  best  to  mode-lock  Ti:Sapphire  laser. 
We  observed  two-photon  fluorescence  in  Coumarin  460,  however,  was  not  able  to  stablize 
the  mode-locked  laser  output.  We  believe  that  mode  structure  and  pointing  stability  could  be 
the  problems  for  mode-locking  the  TiiSapphire  laser.  When  there  is  no  aperture  for  the 
Argon  laser,  high-order  modes  other  than  TEM(M)  exist  in  the  Argon  output  beam  profile.  In 
addition,  we  have  crudely  estimated  the  pointing  stability  as  -  lOOprad,  which  is  an  order  of 
magnitude  larger  than  that  required  for  stable  mode-locking. 

5.  Conclusion 

We  have  observed  saturation  of  photoluminescence  peak.  We  have  measured  time- 
resolved  photoluminescence  decay  in  growth-interrupted  asymmetric-coupled  quantum  wells. 
We  have  subsequently  determined  decay  times  and  characteristic  carrier  densities  for  the 
observed  photoluminescence  intensity  saturation.  We  have  grown  a  multilayer  structure  that 
can  be  used  to  implement  an  optical  parametric  oscillator  and  amplifier  and  frequency  dou¬ 
bler  in  a  novel  configuration.  Finally,  we  have  tried  to  mode-lock  TiiSapphire  laser  and  sum¬ 
marized  potential  problems  causing  unstable  mode-locked  output. 

Two  journal  papers  based  on  our  results  will  be  submitted  for  publication. 
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Abstract 


Laser  based  diagnostic  techniques  were  used  and  developed  in  the  study  of  combustion  and 
compressible  flows.  Laser  diagnostic  techniques  based  on  using  molecular  filters  to  discriminate 
the  Rayleigh  scattering  from  molecules  or  particles  in  a  flow  field  were  investigated.  The  first 
technique  termed  filtered  Rayleigh  scattering  was  used  to  measure  the  instantaneous  temperature 
field  in  hydrogen-air  and  methane-air  flames.  Uncertainties  in  these  measurements  were  explored 
and  found  to  be  from  4  to  7%  for  the  product  region  of  the  flame,  but  slightly  higher  (approximately 
16%)  in  the  reaction  region  of  the  flames  studied.  This  uncertainty  was  primarily  associated  with 
the  unknown  species  concentration  and  possible  solutions  to  this  problem  will  be  investigated.  A 
second  molecular  filter  based  technique  called  filter  planar  velocimetry  (FPV)  allows  the 
measurement  of  the  instantaneous  velocity  in  a  two  dimensional  plane.  FPV  was  used  to  investigate 
axisymmetric  and  elliptic  jets  in  a  M  =  2  cross  flow.  To  date  preliminary  average  measurements 
have  been  analyzed  and  are  presented  here.  Lastly,  flow  visualizations  were  taken  of  underexpanded 
square  jets  to  investigate  the  flow  structure  and  use  as  a  possible  mixing  enhancement  configuration. 
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LASER  BASED  DIAGNOSTIC  TECHNIQUES 
FOR  COMBUSTION  AND  COMPRESSIBLE  FLOWS 

Gregory  S.  Elliott 


I.  THE  USE  OF  FILTERED  RAYLEIGH  SCATTERING  FOR  PLANAR  TEMPERATURE 
MEASUREMENTS  IN  COMBUSTION 

I.A.  Introduction 

Techniques  employing  molecular  filters  to  modify  the  light  scattered  from  particles  or 
molecules  in  a  flow  offer  the  possibility  of  measuring  single  or  multiple  properties  simultaneously. 
The  molecular  filter  is  simply  a  cylindrical  optical  cell  containing  a  molecule  which  has  absorption 
transitions  within  the  frequency  of  the  laser  interrogating  the  flow  field.  The  molecular  filter  is 
placed  in  front  of  the  receiving  optics  to  modify  the  frequency  spectrum  of  the  scattering.  Miles  et 
al.[1992]  first  demonstrated  filtered  Rayleigh  scattering  (FRS)  employing  molecular  iodine  filters 
in  conjunction  with  injection  seeded,  frequency-doubled  Nd:YAG  laser  (A  =  532  nm).  With 
injection  seeding  the  linewidth  is  narrow  (100  MHz)  and  the  laser  frequency  can  be  tuned  to  match 
the  transitions  of  iodine.  In  addition  to  qualitative  flow  visualizations.  Miles  et  al.  [1992]  showed 
that  average  properties  of  the  flow  at  each  point  in  the  illuminated  plane  can  be  obtained  if  the 
scattered  signal  is  collected  from  molecules.  Recently  Schrami  et  al.  [1996]  demonstrated  that  FRS 
could  also  be  used  to  obtain  temperature  measurements  in  lightly  sooting  flames,  but  there  is  still 
a  need  for  more  comprehensive  treatment  of  the  uncertainties  of  the  measurement  technique. 

I.B.  Experimental  Arrangement 

Figure  1  gives  a  schematic  of  the  burner  and  optical  arrangement  for  the  preliminary 
measurements  given  here.  Three  different  burners  were  used  in  this  study.  At  Wright-Patterson  Air 
Force  Base  a  hydrogen-air  flame  was  studied  using  a  Hencken  burner  with  a  25  mm  square 
combustion  region.  The  ability  of  FRS  to  record  temperatures  near  the  burner  surface  were 
investigated  in  a  methane/air  flame  created  above  a  sintered  bronze  McKenna  burner  which  is  water 
cooled  to  283  K.  The  third  burner  is  copper  burner  with  an  array  of  64  1  mm  diameter  holes  within 
an  area  of  169  mm2. 

The  interrogation  laser  beam  was  formed  into  a  sheet  with  a  combination  of  cylindrical  and 
spherical  lenses.  Two  different  frequency  doubled  Nd:YAG  laser  were  used  in  the  present 
experiments  a  Spectra  Physics  GCR-150  (Henken  and  McKenen  burner  experiments  at  Wright 
Paterson)  with  approximately  400  mJ  per  pulse  and  a  Spectra  Physics  GCR-230  (copper  burner 
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experiment  at  Rutgers)  with  650  mJ  per  pulse.  Both  lasers  have  an  injection  seeder  to  provide  a 
narrow  line  width  (~  100  MHz)  which  can  be  tuned  through  the  absorption  lines  of  iodine  around 
532  nm.  The  Nd:YAG  lasers  used  in  these  experiments  have  a  pulse  width  of  approximately  10  ns 
with  a  repetition  rate  of  10  Hz.  Two  photodiodes  were  used  to  monitor  the  laser  energy  and 
frequency  fluctuations,  using  a  second  iodine  filter.  The  Rayleigh  scattering  signal  is  collected  using 
a  Princeton  Instruments  14-bit  intensified  CCD  camera.  The  images  are  stored  on  a  Pentium  100 
MHZ  personal  computer  which  provided  camera  control,  laser  synchronization,  and  frequency 
tuning  control  of  the  laser. 

A  major  component  of  the  FRS  system  is  the  iodine  filter.  The  iodine  filter  is  basically  a 
glass  cylinder  9  cm  in  diameter  and  24  cm  in  length  with  flat  optical  windows  on  both  ends.  Iodine 
vapor  is  formed  in  the  cell  by  inserting  a  small  amount  of  iodine  crystals  and  evacuating  the  cell. 
The  cell  temperature  (Tcell)  is  raised  above  the  ambient  temperature  with  electrical  heat  tape  so  that 
no  iodine  crystallizes  on  the  windows.  The  coldest  point  in  the  cell  is  set  in  the  side  arm  (TI2),  which 
is  housed  in  a  water  jacket  and  maintained  at  a  constant  temperature  by  a  circulation  water  bath.  The 
temperature  of  the  side  arm  controls  the  vapor  pressure  (number  density)  of  the  iodine  in  the 
absorption  cell.  The  Nd:  YAG  laser  has  a  narrow  line  width  and  atX  =  532  nm  it  can  be  tuned  across 
the  absorption  bands  of  iodine.  Figure  2  presents  the  absorption  spectra  with  the  two  optically  thick 
absorption  lines  which  were  used  in  the  present  experiments  at  wave  numbers  of  18788.44  and 
1 8789.28.  The  profile  was  taken  with  the  cell  operated  at  Tcen  =  358  K  and  T,2  =  3 1 8  K. 


I.C.  Theoretical  Description 

Rayleigh  scattering  has  been  used  to  investigate  combustion  and  flow  fields  for  many  years 
and  much  of  the  theoretical  analysis  is  well  known.  When  interrogating  a  flow  with  a  laser  and 
collecting  the  Rayleigh  scattering  signal  from  molecules,  there  are  many  important  parameters 
governing  the  scattered  intensity  (intensity  of  the  illuminating  light,  polarization,  frequency  of  the 
illuminated  light,  etc.).  In  the  current  use  of  the  FRS  technique  in  combustion  environments,  the 
intensity  and  spectral  profile  of  the  scattering  are  needed  to  deduce  the  temperature.  Since  the 
flames  studied  here  are  at  atmospheric  pressure  the  Rayleigh  scattering  can  be  in  the  kinetic  regime 
and  contain  Brillouin  scattering  effects.  Many  different  models  exist  for  calculating  the  Rayleigh 
scattering  spectral  profile  [Yip  and  Nelkin,  1964  and  Tenti  et  al.,1972]  and  have  been  confirmed  by 
experimental  measurements  [Lao  et  al.,  1976],  The  shape  of  the  scattered  spectrum  is  typically 
parameterized  by  the  dimensionless  frequency  X  and  the  y  parameter.  The  dimensionless  frequency 
is  nondimensionalized  by  the  thermal  broadening  and  is  given  by 


X 


^lv-v0l  M 
2 sin (0/2)  \j  2kT 
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where  X  is  the  wavelength  of  the  incident  light,  (v  -v0)  is  the  frequency  difference  from  line  center, 
0  is  the  angle  between  the  incident  and  scattered  wave  vectors,  k  is  the  Boltzmann’s  constant,  T  is 
the  temperature  of  the  gas,  and  M  is  the  molecular  mass.  The  y  parameter  (which  is  the  ratio  of  the 
collisional  frequency  to  the  acoustic  spatial  frequency)  describes  the  shape  of  the  Rayleigh  scattering 
spectrum  and  is  given  by 
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where  P  is  the  pressure,  and  p  is  the  viscosity.  For  y  of  the  order  of  unity,  the  kinetic  regime, 
Brillouin  components  become  important  and  kinetic  models  must  be  used.  For  y  «1  (low  pressures 
or  high  temperatures)  the  scattering  spectrum  is  Gaussian  and  the  Brillouin  components  can  be 
neglected.  The  total  spectral  intensity  is  given  by  the  intensity  of  the  scattering  of  the  ith  gas  species 
weighted  with  its  mole  fraction  x;  and  Rayleigh  cross  section  oRi  and  is  given  by 

I(v)  =CI0N^.  Xj oRi ^(TjPjMj.O.v)  (3) 

where  C  is  the  optical  calibration  constant,  I0  is  the  incident  laser  light  intensity,  N  is  the  total 
number  density,  and  r,  is  the  scattering  distribution  of  the  ith  gas  species  determined  by  the  kinetic 
model  (ie.  S6  model  used  by  Tenti  et  al.,  1972). 

The  intensity  collected  by  the  camera  is  a  convolution  of  the  total  spectral  intensity  [I(v  )]and 
the  transmission  profile  [A(v)]  as  shown  schematically  in  Figure  3.  As  illustrated  in  Figure  3  the 
scattering  from  particles  and  surfaces  has  a  narrow  linewidth  and  is  attenuated  by  the  iodine  filter. 
Since  the  Rayleigh  scattering  from  molecules  is  thermally  broadened  much  of  the  scattered  intensity 
passes  outside  the  absorption  profile  (as  illustrated  by  the  shaded  region  of  Figure  3).  This  is  a 
significant  characteristic  of  FRS  since  molecular  Rayleigh  scattering  is  weak  relative  to  surface  and 
particle  scattering. 

In  order  to  eliminate  the  dependence  on  the  optical  calibration  constant  and  on  the  incident 
laser  irradiance  distribution,  images  with  the  flame  on  were  normalized  by  scattering  collected  from 
the  air  at  ambient  conditions.  For  the  present  experiments  only  premixed  flames  are  considered  and 
the  dominate  species  is  nitrogen,  and  therefore  the  scattering  will  be  assumed  to  be  from  a  single 
species.  Also  the  velocity  component  with  the  current  arrangement  leads  to  little  change  in 
frequency  due  to  Doppler  shift  and  so  it  will  be  ignored  in  the  present  analysis.  With  these 
assumptions  and  since  the  flames  are  all  at  atmospheric  pressure  the  intensity  is  a  function  only  of 
temperature  for  a  given  optical  arrangement  and  the  ratio  is  given  by 
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The  local  gas  temperature  influences  the  collected  intensity  through  the  gas  number  density  (N)  and 
the  thermal  broadening  which  increases  the  portion  of  the  scattering  which  is  not  absorbed  by  the 
filter.  Thus  if  the  normalized  intensity  ratio  is  known  [S(v)/Sair(v)]  the  temperature  can  be 
determined. 

I.D.  Hydrogen-Air  Flame  Results 

Figure  4  gives  the  two  dimensional  temperature  field  (x-y  view)  for  a  hydrogen-air  flame  ($ 
=  0.25)  2  cm  above  the  Henken  burner  with  a  plot  of  the  temperature  profile  in  the  center  of  the 
image  given  below  the  appropriate  image.  It  should  be  noted  that  no  particular  care  was  taken  to 
physically  eliminate  particles  in  the  flame  or  in  the  surroundings.  Scattering  from  particles  generally 
will  dominate  the  Rayleigh  scattering,  however,  in  the  present  experiment  the  particle  scattering  is 
strongly  absorbed  since  it  is  not  thermally  broadened.  The  adiabatic  flame  temperature  in  this  case 
is  1065  K.  The  first  case  (Fig.  4a)  shows  a  single  shot  instantaneous  temperature  field  with  the 
average  temperature  away  from  the  edges  of  the  flame  being  1 062  K.  The  instantaneous  temperature 
profile  has  a  high  degree  of  fluctuation  in  what  should  be  a  relatively  flat  profile  as  shown  by  CARS 
measurements.  Most  likely  this  fluctuation  is  due  to  the  photon  statistics.  Thus  the  signal  to  noise 
ratio  can  be  improved  by  either  increasing  the  number  of  photons  collected  (more  laser  energy  per 
pulse),  low  pass  filtering,  or  binning  multiple  pixels  together.  Figure  4b  shows  the  effect  of  binning 
3  adjacent  pixels  together  (Fig.  4c)  and  as  expected  the  random  temperature  fluctuations  decrease, 
but  at  the  cost  of  decreased  spatial  resolution  (from  0.1  mm2  to  1  mnf  ).  Figure  4c  is  the  two- 
dimensional  temperature  field  made  by  averaging  40  instantaneous  images  together.  As  expected 
this  shows  the  least  amount  of  temperature  fluctuation,  but  of  course  the  instantaneous  information 
is  lost. 

Figure  5a  and  5b  give  the  average  temperature  and  standard  deviation  of  the  temperature  in 
the  flat  portion  of  the  hydrogen-air  flame  for  various  equivalence  ratios.  The  temperatures  measured 
with  FRS  are  in  excellent  agreement  with  the  adiabatic  flame  temperature.  The  maximum  deviation 
is  at  the  highest  temperatures  with  the  FRS  measurements  always  slightly  lower  as  expected.  The 
standard  deviation  of  the  temperature  across  the  flat  portion  of  the  flame  is  given  in  Figure  5b  for 
the  instantaneous,  bin  by  3,  and  average  temperature  fields  for  various  equivalence  ratios.  As 
expected  binning  the  pixels  of  the  instantaneous  image  by  three  increases  the  signal  to  noise  ratio 
by  approximately  three  for  all  cases  which  would  be  the  case  if  the  fluctuations  were  a  result  of 
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photon  statistics.  The  percent  of  temperature  fluctuation  for  the  average,  instantaneous,  and 
instantaneous  bin  by  three  temperature  fields  were  found  to  be  approximately  2%,  8%,  3% 
respectively. 

I.E.  Methane-Air  Flame  Results 

One  advantage  of  FRS  is  that  it  can  be  used  to  make  measurements  close  to  surfaces  without 
being  affected  strongly  by  surface  scattering,  since  surface  scattering  like  particle  scattering  is  at  the 
laser  frequency  and  is  not  Doppler  shifted  or  broadened  outside  of  the  absorption  profile.  Figure  6a 
and  b  are  the  respective  average  two-dimensional  temperature  fields  of  a  methane-air  flame  lifted 
from  the  surface  of  a  McKenna  burner.  The  burner  is  water  cooled  and  has  a  surface  temperature 
of  approximately  283  K  with  the  edge  located  at  x  =  0  slightly  off  the  left  side  of  the  image.  For  the 
lifted  flame  (Fig.  6a)  the  temperature  before  the  flame  front  is  slightly  low  (262  K)  which  is  due 
primarily  to  the  fact  that  the  Rayleigh  scattering  cross  section  of  the  mixture  is  slightly  higher  than 
that  of  pure  nitrogen.  For  the  attached  flame  (Fig.  6b)  the  temperature  gradient  can  be  seen  on  the 
left  side  of  the  image  due  to  the  greater  heat  transfer  at  the  burner  edge.  The  adiabatic  flame 
temperature  for  the  lifted  and  attached  flames  was  1623  K  and  1690  K,  respectively,  while  the 
measured  temperatures  were  1522  K  and  1537  K  respectively.  Note  that  the  measured  values  are 
expected  to  be  lower  due  to  the  heat  transfer  to  the  much  cooler  burner  surface.  At  the  current 
resolution,  measurements  were  made  within  300  pm  of  the  burner  surface,  limited  mostly  by  the 
focal  length  of  the  camera  lens. 

One  of  the  greatest  advantages  of  FRS  for  two  dimensional  temperature  measurements  is  in 
its  ability  to  capture  instantaneous  fluctuations  of  unsteady  flames.  Figure  7  is  the  average  (Figure 
7a)  and  instantaneous  temperature  fields  (Figs.  7b-h)  of  the  premixed  methane-nitrogen-oxygen 
flame  (<3>=  1.21,  adiabatic  flame  temperature  of  251  OK).  Using  FRS  the  average  flame  temperature 
was  measured  at  2265  K.  Again  this  lower  temperature  is  due  in  part  to  unsteadiness  in  conjunction 
with  the  averaging  process.  Almost  all  the  instantaneous  images  show  buoyantly  driven  vortices 
rolling  up  on  the  edge  of  the  flame  and  the  associated  temperature  variation  within  them. 

I.F.  Uncertainty  Analysis 

A  simple  uncertainty  analysis  has  been  performed  to  estimate  the  uncertainty  associated  with 
the  temperature  field  measurements  using  FRS.  The  uncertainties  for  the  hydrogen-air  flame  are 
approximately  8.5%  on  average  which  seems  high  when  compared  with  the  agreement  obtained 
between  the  FRS  measured  temperatures  and  the  adiabatic  flame  temperature.  This  is  due  mainly 
to  the  fact  that  for  these  flames  the  uncertainty  due  to  the  molecular  weight  and  Rayleigh  scattering 
cross  section  are  added  when  in  actuality  they  cancel  each  other  out  (ie.  the  molecular  weight  is 
lower  than  nitrogen  which  results  in  more  of  the  scattering  broadened  outside  of  the  absorption  filter 
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and  the  Rayleigh  scattering  cross  section  is  lower  than  nitrogen  resulting  in  less  signal).  Taking  this 
fact  into  consideration  reduces  the  uncertainty  to  approximately  5%  which  is  more  in  line  with  the 
measurements. 

The  highest  uncertainties  (above  10%)  were  found  to  be  within  the  reactant  region  of  the 
methane  flame  particularly  for  the  flame  with  the  equivalence  ratio  of  1 .2.  The  predominance  of 
methane  with  a  Rayleigh  scattering  cross  section  twice  that  of  nitrogen  causes  the  temperatures  to 
be  under  estimated  as  is  seen  in  the  FRS  measurements  presented  in  Figure  6  and  7.  The  product 
region  of  the  flame,  however,  shows  a  lower  uncertainty  of  approximately  4%  for  these  methane 
flames  since  the  molecular  weight  and  Rayleigh  scattering  cross  section  is  much  closer  to  nitrogen. 
The  uncertainties  for  the  instantaneous  images  are  higher  (due  to  higher  noise  in  the  recorded 
intensity)  being  approximately  20%  for  the  reactant  region  and  7%  for  the  product  region  of  the 
flame.  It  should  also  be  noted  here  that  the  values  for  the  uncertainty  of  the  independent  variables 
were  generally  over  estimated  resulting  in  very  conservative  estimates  of  the  total  uncertainty  in  the 
temperature  measured  with  FRS. 


II.  FILTER  PLANAR  VELOCIMETRY  MEASUREMENTS  OF  A  JET  IN  A  M  =  2  CROSS 
FLOW 

II  .A.  Introduction 

Although  filtered  Rayleigh  scattering  relies  on  scattering  from  molecules  for  the  temperature 
measurements  described  above,  other  techniques  use  scattering  off  of  particles  natural  present  or 
seeded  into  the  flow.  These  molecular  filter  Based  velocity  techniques  have  been  developed  to 
measure  the  average  velocity  (Doppler  Global  Velocimetry,  DGV,  [Meyers  and  Komine,  1991])  or 
instantaneous  velocity  (Filtered  Planar  Velocimetry,  FPV,  [Elliott  et  al.,  1994]).  FPV  utilizes  two 
cameras.  One  camera  has  a  pressure  broadened  iodine  filter  in  front  of  it  which  produces  gradual 
slopes  (Figure  8)  in  the  absorption  profile  instead  of  the  sharp  temperature  broadened  filter  used  by 
Miles  et  al.  [1992],  A  second  camera  with  no  filter  (similar  to  DGV)  is  used  to  normalize  the  image 
from  the  filtered  camera,  accounting  for  variations  in  laser  intensity,  and  particle  size  and  density 
in  the  measurements.  When  the  laser  frequency  is  tuned  to  the  edge  of  the  sloping  region,  the 
Doppler  shift  will  move  the  scattered  intensity  into  the  sloping  region  of  the  absorption  profile 
resulting  in  a  deviation  in  the  transmission  ratio  (Figure  8).  The  Doppler  shift  is  given  by 

(5) 

where  L  and  k  0  are  the  observed  and  incident  unit  light  wave  vectors,  respectively,  and  V  is  the  flow 
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velocity  vector.  Thus  when  the  transmission  ratio  is  calculated  from  the  two  cameras  the  frequency 
shift  can  be  obtained  and  the  velocity  is  calculated  from  Equation  6.  Instantaneous  velocities  have 
been  measured  in  compressible  mixing  layers  [Elliott  et  ah,  1994]  and  multiple  velocity  components 
in  boundary  layers  [Amette  et  al.,  1995]  using  FPV. 

II.B.  Experimental  Arrangement 

The  flow  field  studied  using  FPV  was  a  jet  in  a  Mach  2  cross  flow  which  utilized  the 
supersonic  research  facility  at  Wright-Paterson  Air  Force  Base.  Details  of  this  facility  are  presented 
elsewhere  [Gruber  et  al.  1996].  Figure  9  gives  a  schematic  of  the  tunnel,  laser,  and  camera  geomtry. 
The  test  section  for  these  experiments  has  a  constant  area  test  section  with  a  cross  section  of  13 1mm 
by  152  mm.  The  tunnel  can  be  run  continuously  with  a  stagnation  temperature  and  pressure  of  300 
K  and  317  kPa  respectively  resulting  in  an  average  free  stream  velocity  of  5 1 6  m/s.  Two  different 
injector  geometries,  circular  (6.35  mm  diameter)  and  elliptic  (6.2  mm  major  axis  and  1.63  minor 
axis,  effective  diameter  of  6.35  mm)  were  incorporated  into  removable  test  inserts  housed  within  the 
bottom  wall  of  the  test  section.  The  exit  pressures  of  both  jets  were  regulated  to  476  kPa  resulting 
in  a  jet  to  free  stream  momentum  flux  ratio  of  2.93.  Particles  were  seeded  into  the  freestream  by  the 
combustion  of  silane  which  produces  silicone  dioxide  particles  in  the  range  of  0.2  pm  diameter. 

The  laser  and  camera  system  used  here  was  the  same  as  in  the  FRS  experiments  with  the 
exception  of  an  additional  camera  which  recorded  the  normalized  image  for  each  instantaneous 
filtered  image.  Another  difference  between  the  FRS  and  FPV  system  is  in  the  absorption  properties 
of  the  iodine  filters  used  with  each  system.  Figure  10  shows  the  difference  between  the  two  filter 
profiles  used  in  this  study.  The  filter  profile  used  in  the  FRS  measurements  was  dominated  by 
thermal  broadening  resulting  in  relatively  sharp  slopes  on  either  side  of  the  profile.  The  absorption 
profile  used  for  FPV  has  a  much  more  gradual  profile  which  is  made  by  introducing  a  nonabsorbing 
gas  (nitrogen  with  a  partial  pressure  of  1 5  torr)  into  the  iodine  filter.  This  results  in  the  absorption 
profile  being  dominated  by  pressure  broadening  effects  which  can  be  used  to  adjust  the  slope  for  the 
expected  frequency  range  encountered  in  the  flow  studied. 

In  the  past  when  making  velocity  measurements  using  molecular  filter  based  techniques  a 
problem  is  encountered  in  measuring  a  velocity  component  which  is  “natural”  to  the  flow  being 
studied  is  difficult  without  using  more  than  one  camera  pair  or  viewing  the  flow  at  oblique  angles. 
To  circumvent  this  problem  we  chose  to  use  the  laser  optical  arrangement  of  Figure  9  and  reflect  the 
laser  sheet  back  on  itself.  From  Equation  6  it  can  be  shown  that  for  the  present  arrangement  this 
results  in  a  sensitivity  to  the  velocity  vector  essentially  in  the  streamwise  direction  and  instantaneous 
streamwise  velocities  can  be  obtained.  In  order  to  find  information  about  the  spanwise  velocity  a 
single  laser  pass  was  used  resulting  in  a  velocity  vector  40  degrees  from  the  streamwise  direction. 
Although  the  spanwise  velocity  component  can  not  be  decoupled  from  the  streamwise  velocity 


17-9 


component  for  the  instantaneous  measurements,  for  the  average  velocity  field  both  optical 
arrangements  can  be  combined  to  calculate  the  spanwise  velocity  component. 

II.C.  Preliminary  results 

Measurements  were  taken  at  several  streamwise  locations  for  the  circular  and  elliptic  jet 
injection  geometries  both  with  and  without  the  laser  sheet  reflecting  back  on  itself.  At  the  present 
time  only  the  average  velocity  images  have  been  reduced.  Figure  1 1  gives  of  the  unfiltered  image 
and  streamwise  velocity  image  for  the  circular  jet,  and  the  spanwise  velocity  image  for  the  elliptic 
jet.  For  these  preliminary  results  the  ffeestream  velocity  was  used  to  calibrate  the  measurements 
which  will  not  be  necessary  when  the  instantaneous  images  are  fully  analyzed.  As  expected  the 
streamwise  velocity  image  shows  the  velocity  decrease  after  the  bow  shock  consistent  with  the  shock 
angle  which  the  flow  has  passed  through.  The  streamwise  velocity  decreases  further  as  the  jet  core 
and  boundary  layer  are  approached.  It  should  be  kept  in  mind,  however,  that  the  jet  was  not  seeded 
and  therefore  the  measurements  in  the  core  region  are  biased  by  the  free  stream  velocity  resulting 
in  a  higher  value  than  is  really  the  case.  The  spanwise  velocity  image  shows  essentially  zero 
spanwise  velocity  outside  the  bow  shock  and  in  the  core  of  the  jet.  There  are  however  positive  and 
negative  spanwise  velocity  components  inside  the  bow  shock  to  the  left  and  right  of  the  jet 
respectively.  This  is  most  likely  due  to  the  streamwise  vortices  which  are  formed  around  the  jet. 
It  should  be  emphasized  that  these  results  are  very  preliminary  and  further  conclusions  will  be  drawn 
as  the  instantaneous  images  are  calculated. 


III.  FLOW  VISUALIZATIONS  OF  AN  UNDEREXPANDED  SQUARE  JET 
III.A.  Introduction 

In  recent  years  much  effort  has  been  devoted  into  investigating  non-circular  (ie.  square, 
rectangular,  elliptical)  jets  and  the  possibilities  of  their  use  to  enhance  mixing  particularly  in  the 
compressible  flow  regime  [Grinstein,  1996  and  Glawe,  1995],  These  non-circular  geometries  find 
applications  in  thrust  vectoring,  enhancing  entrainment  for  combustion,  and  heat  signature  reduction. 
Square  jets  have  shown  45  degrees  of  axis  rotation  in  the  near  field  caused  by  the  vortex  dynamics 
in  the  comers  of  the  jet.  Although  these  observations  have  been  made  there  is  still  a  need  to  better 
understand  the  fluid  mechanisms  which  govern  these  observations.  As  a  first  step  toward  this  goal, 
flow  visualizations  and  FPV  images  have  been  taken  in  an  under  expanded  square  jet.  Preliminary 
images  are  given  here  with  the  majority  of  the  data  still  to  be  processed. 
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III.  B.  Experimental  Arrangement 

The  laser  and  camera  system  were  the  same  as  described  previously  for  the  FRS  and  FPV 
experiments,  allowing  instantaneous  (pulse  width  of  10  ns)  images  to  be  taken.  The  square  jet 
studied  has  a  25.4mm  by  24.5mm  cross  section  and  can  be  operated  continuously  with  the  supply 
air  at  the  Wright-Patterson  Air  Force  Base  supersonic  research  facility.  Stagnation  pressures  from 
240  to  517  kPa  were  studied  which  resulted  in  an  underexpanded  jet  for  all  conditions.  Two 
different  laser  sheet  orientations  were  investigated  one  aligned  parallel  to  the  jet  axis  (streamwise 
view)  and  one  with  the  sheet  aligned  perpendicular  to  the  jet  axis  (spanwise  view).  The  scattering 
was  collected  from  ice  particles  formed  from  the  moisture  in  the  supply  air  as  the  temperature 
decreases  through  the  nozzle  (marking  the  core  of  the  jet),  and  as  the  moist  ambient  air  mixes  with 
the  colder  supersonic  air  (marking  the  mixing  region  of  the  jet).  These  ice  particles  have  been  used 
by  other  investigators  to  study  various  flow  fields  using  similar  techniques  [Elliott  et  al.,  1992], 

III.  C.  Preliminary  Results  and  Discussion 

Average  streamwise  images  are  given  in  Figure  12  for  the  square  jet  operated  at  P0  =  240  to 
517  kPa.  Each  average  image  is  calculated  from  120  instantaneous  images  and  normalized  to 
remove  the  Gaussian  intensity  distribution  in  the  laser  sheet.  Oblique  shock  and  expansion  waves 
are  clearly  visible  above  a  stagnation  pressure  of  3 10  kPa.  What  appears  to  be  a  Mach  disk  occurs 
for  stagnation  pressures  above  380  kPa  at  an  x/s  *  1.5  (where  s  is  the  height  of  the  jet). 
Instantaneous  streamwise  images  are  given  in  Figure  13.  For  P0  =  240  and  345  kPa  large  scale 
structures  are  apparent  with  alternate  spacing  above  and  below  the  jet  centerline  similar  to  what  is 
seen  in  axisymmetric  jets  when  a  helical  mode  is  present.  Also  the  instantaneous  shock/expansion 
wave  pattern  is  much  different  than  the  average  images  for  the  P0  =  345  kPa  case.  Spanwise  images 
were  taken  for  each  operating  pressure  at  various  downstream  locations.  Average  spanwise  images 
for  the  square  jet  operating  at  P0  =  276  and  517  kPa  are  given  in  Figure  14.  For  the  lower  stagnation 
pressure  case  evidence  of  streamwise  vorticity  in  the  comers  of  the  jet  is  visible,  but  there  appears 
to  be  no  axis  rotation  as  the  shear  layers  on  the  sides  of  the  jet  develop  down  stream.  For  the  higher 
stagnation  pressure  case  (P0  =  517  kPa),  however,  axis  rotation  of  45  degrees  is  observed  as  the 
complex  shock  and  expansion  wave  patterns  develop  downstream. 
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Figure  1.  Schematic  of  laser  and  optical  arrangement  for  FRS  temperature  measurements. 


Wavenumber  (c m  ’) 


Figure  2.  Absorption  lines  of  iodine  in  the  frequency  tuning  range  of  the  Nd:YAG  laser. 
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Figure  3.  Schematic  of  absorption  profile  convoluted  with  molecular  Rayleigh  scattering. 


Figure  4.  Instantaneous  (a)  bin  by  3  (b)  and  average  (c)  images  and  temperature 
profiles  taken  using  FRS  above  a  hydrogen-air  flame. 
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Figure  5.  Average  temperature  (a)  and  standard  deviation  (b)  of  the  temperatures  for  a  hydrogen 
air  flame  operated  at  various  equivalence  ratios. 
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Figure  6.  Average  two-dimensional  temperature  fields  of  a  methane-air  flame  lifted 
(a)  and  attached  (b)  to  the  burner  surface. 
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Figure  9.  Plan  view  of  the  tunnel  and  optical  arrangement  for  FPV  measurements  of  a  jet  in  a 
supersonic  cross  flow. 
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Figure  10.  Iodine  absorption  filter  profiles  with  and  without  pressure  broadening  from  nitrogen 
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Figure  11.  Reference  (a),  streamwise  velocity  (b),  and  spanwise  velocity  (c)  average  images  of  a  jet 
in  a  Mach  2  cross  flow  taken  using  FPV. 
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Figure  12.  Average  streamwise  flow  visualizations  of  underexpanded  square  jets  operated  at 
stagnation  pressures  (kPa)  of  240  (a),  276  (b),  310  (c),  345  (d),  380  kPa  (e),  414  (f),  450  (g), 
and  517  (h). 
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Figure  13.  Instantaneous  streamwise  flow  visualizations  of  underexpanded  square  jets  operated 
at  stagnation  pressures  (kPa)of276  (a),  345(b),  414(c),  and  517(d). 
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Figure  14.  Average  sspanwise  flow  visualizations  of  underexpanded  square  jets  operated  at 
stagnation  pressures  (kPa)  of  276  (a-d),  and  517  (e-h). 
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Abstract 


In  this  report,  a  vertically-connected  3D  T/R  module  for  a  conformal  phased  array  antenna 
systems  is  investigated.  Active  and  passive  elements  and  circuits  suitable  for  the  proposed  3/D 
modules  are  analyzed.  The  module  involves  the  use  of  thermally  shunted  HBT  for  power 
amplifier,  various  distributed  circuit  elements  realized  by  coplanar  waveguides  (CPW),  and  vertical 
posts  interconnecting  the  various  circuit  layers.  Coplanar  waveguides  completely  immersed  in  a 
dielectric  medium  and  surrounded  by  an  upper  and  a  lower  ground  plane  is  analyzed  using 
conformal  mapping.  It  is  shown  that  if  the  separation  distance  between  the  side  ground  planes  of 
the  CPW  is  small  compared  to  the  vertical  distances  to  the  upper  and  lower  ground  planes,  the 
impedance  of  the  line  becomes  independent  of  the  dielectric  layer  thickness.  Sycar  is  chosen  as  the 
dielectric  material.  Uniformity  and  thickness  control  during  the  processing  of  sycar  will  lead  to 
reliable  low  loss  vertical  interconnects.  The  goal  of  the  project  is  to  implement  a  3D  T/R  module 
and  demonstrate  the  feasibility  of  3D  vertical  interconnects  by  incorporating  these  modules  in  a  2x2 
phased  array  antenna  system. 
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VERTICAL  3-D  INTERCONNECTS  FOR 
MULTICHIP  MODULES 


Altan  M.  Ferendeci 

Introduction 

At  present,  solid  state  power  sources  can  not  replace  the  high  power  slow-wave  microwave 
tubes.  But  by  incorporating  novel  power  combining  techniques,  the  overall  combined  power  of  the 
solid  state  sources  may  approach  that  of  a  single  tube  output.  This  is  especially  true  for  phased 
array  antenna  systems  where  each  radiator  has  its  own  power  amplifier  and  the  total  radiated  power 
is  the  combined  power  of  all  the  radiators.  In  a  phased  array  antenna  system,  the  transmit/receive 
(T/R)  modules  are  placed  on  a  line  (linear  array),  on  a  plane  (planar  array)  or  on  a  circular  area 
(circular  array).  To  eliminate  the  main  side  lobes,  the  separation  distance  between  the  antennas 
should  be  less  than  half  the  free  space  wavelength  of  the  radiated  signal. 

At  microwave  and  millimeter  wave  frequencies,  planar  antennas  are  replacing  the 
conventional  antenna  structures.  The  dimensions  of  the  patch  antennas  are  generally  less  than  the 
required  free  space  half-wavelength.  They  can  be  configured  to  be  used  in  any  array  configuration. 
Unfortunately,  if  a  compact  module  including  all  the  electronics  is  to  be  integrated  with  the  patch 
antenna,  it  is  impossible  to  place  all  the  individual  circuit  modules  on  the  same  substrate  and  at  the 
same  time  satisfy  the  antenna  spacing  requirements.  This  becomes  more  important  especially  at 
millimeter  wavelengths.  Even  though  most  of  the  system  components  are  in  the  form  of  monolithic 
integrated  circuits,  they  still  occupy  a  large  surface  area  of  a  substrate.  Even  if  the  complete  circuit 
can  be  laid  out  on  the  same  GaAs  or  InP  substrate,  it  will  too  expensive  to  use  this  type  of  large 
area  substrates  behind  each  of  the  radiator  .  The  complete  transmit/receive  (T/R)  module  also 
contains  a  high  power  amplifier.  Unless  precautions  are  taken  to  remove  the  generated  heat, 
placing  this  circuit  on  the  same  substrate  with  other  temperature  sensitive  circuits  such  as  low  noise 
preamplifiers  may  not  be  desirable.  Even  if  lumped  elements  are  replaced  by  distributed  elements  at 
higher  frequencies,  the  overall  T/R  module  may  still  require  a  large  substrate  area. 

At  present,  all  circuit  modules  are  placed  an  enclosed  casing.  The  power  is  coupled  to  the 
antenna  through  one  end  of  the  casing.  Unfortunately,  this  increases  the  overall  depth  of  the  T/R 
module  relative  to  the  lateral  dimensions  of  the  antenna.  These  modular  units  have  the  distinct 
disadvantage  of  having  a  surface  area  to  depth  ratio  very  small.  Even  with  these  limitations,  they 
can  still  be  utilized  in  a  linear  or  a  planar  array.  They  can  not  be  distributed  over  a  general  curved 
surface.  For  a  module  to  be  universally  conformal  to  any  surface,  the  depth  of  the  module  should 
be  negligibly  small  compared  to  the  maximum  lateral  dimensions  of  the  modular  unit. 
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It  is  possible  to  place  the  circuit  modules  vertically  one  over  the  other  so  that  the  final  T/R 
module  will  occupy  less  space.  3D  module  configuration  dictates  the  development  of  a  new 
packaging  technology  so  that  various  circuit  modules  at  various  levels  can  be  vertically 
interconnected  with  each  other  with  little  or  no  loss  in  signal  amplitude,  with  no  cross  coupling 
between  the  elements  and  with  manageable  processing  steps. 

There  is  already  considerable  research  and  development  effort  being  extended  to 
incorporate  vertical  interconnects  in  the  implementation  of  monolithic  microwave  integrated  circuits 
(MMIC)[1,2].  There  are  two  major  concentrated  research  areas  that  utilize  3D  vertical 
interconnects.  First  area  is  the  use  of  vertical  interconnects  for  compacting  the  size  of  an  integrated 

circuit  module  [3],  In  this  scheme,  thin  multiple  dielectric  layers  (2.5  \xm)  are  used  for  connecting 

the  various  elements  of  the  circuit  together.  The  second  major  concentrated  effort  is  to  stack 
various  circuit  modules  of  the  overall  T/R  module  one  over  the  other  using  vertical  interconnects. 
Actually  both  of  these  concepts  complement  each  other  since  the  final  T/R  module  may  contain 
various  vertically  interconnected  MMICs. 

In  either  case,  coupling  of  microwave  or  millimeter  wave  signals  between  the  layers 
become  a  major  area  of  research.  Already,  there  is  considerable  research  work  going  on  related  to 
the  coupling  mechanisms  between  transmission  lines  of  the  various  layers  [4].  Novel  transmission 
lines  are  also  proposed  and  analyzed  [5-9]. 

In  this  report,  a  complete  3D  vertically  interconnected  3D  T/R  module  for  a  conformal 
phased  array  antenna  system  is  proposed.  Details  of  the  proposed  configuration  including  an 
integral  patch  antennas  will  be  given.  The  major  components  of  the  T/R  module  and  the  specific 
areas  that  have  to  be  developed  will  be  discussed. 

Phased  Array  Antenna  Module 

Typical  front  end  of  a  T/R  module  for  a  radar  system  is  shown  Figure  1.  Figure  2  shows  a 
2x2  modular  phased  array  antenna  system.  The  control  and  processing  electronics  are  not  shown  in 
these  figures.  The  lowest  substrate  layer  contains  the  power  amplifier.  Heat  can  be  easily 
transferred  from  the  substrate  by  placing  the  overall  module  on  a  highly  heat  conducting  metallic 
surface. 

Various  layers  of  the  T/R  module  are  processed  by  deposition  of  multiple  dielectric 
materials  of  known  thickness  followed  by  metal  depositions  immersed  between  the  dielectric 
layers.  As  the  module  is  processed,  necessary  interconnecting  posts  (or  vias)  are  electro-deposited 
and  the  circuit  for  that  layer  is  finally  processed. 
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The  upper  layer  of  the  module  has  a  patch  antenna  which  could  be  either  in  rectangular, 
circular  or  ring  shape  [10].  The  thickness  of  the  upper  dielectric  layer  below  the  patch  can  be 
adjusted  to  provide  the  maximum  radiation  condition  for  a  specific  antenna  [11]. 

Typical  dimensions  for  patch  antennas  operating  at  10  GHz  and  with  a  dielectric  material  of 

£=2.5  are  given  in  Table  I.  The  overall  dimensions  of  these  patches  are  smaller  than  X/2=15  mm 

and  thus  can  be  used  in  phased  array  antenna  systems  since  they  easily  satisfy  the  phased  array 
antenna  spacing  requirements . 
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Figure  1.  Major  components  of  a  TfR  module. 


Figure  3  shows  possible  distribution  of  vertically  interconnected  3D  T/R  modules  over 
various  surface  configurations.  These  surface  can  actually  be  of  any  shape  provided  the  radius  of 
curvature  of  the  surface  is  not  too  small  compared  the  lateral  dimensions  of  the  T/R  module.  Since 
the  overall  thickness  of  the  proposed  vertically  interconnected  3D  T/R  module  is  expected  to  be  a 
few  millimeters,  they  can  easily  be  distributed  over  any  conformal  surface  topology.  These 
properties  may  lead  to  vast  number  of  applications  in  communication  and  radar  systems. 
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Table  I.  Dimensions  of  a  patch  antennas 
(f=10GHz,  X/2  =  15mm,  e=2.5) 
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Figure  2.  4  element  phased  array  antenna  system  with  3D  T/R  modules. 

Figure  4  shows  the  essential  components  of  a  typical  representative  vertically 
interconnected  3D  T/R  module.  A  substrate  (preferably  a  Si-Silicon  or  Alumina)  is  the  base  of  the 
building  block.  Since  the  substrate  can  be  attached  to  another  heat  conducting  material  such  as  a 
metal,  the  power  amplifier  circuitry  is  processed  on  this  substrate.  The  power  amplifier  chip  is 
attached  to  the  Si-substrate  by  micromachining  an  opening  on  the  Silicon  surface  and  flip  bonding 
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of  the  chip  into  the  opening  [12,13].  Since  the  dielectric  thickness  of  the  vertical  layers  are 
relatively  thin  and  their  thickness  may  be  comparable  to  the  overall  height  of  a  thermally  shunted 
HBT,  it  is  recommended  that  the  micromachining  depth  should  be  as  deep  as  the  transistor  height 
so  that  the  ensuing  dielectric  layers  can  be  deposited  uniformly  over  the  substrate  material. 

Planar 


Figure  3.  Distribution  of  patch  antennas  over  a  line,  plane  or  a  cylinder. 
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Figure  4.  Representative  components  of  a  3D  module. 
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There  are  four  major  areas  that  have  to  be  developed  in  implementing  a  vertically 
interconnected  3D  T/R  module.  These  are: 

a)  Power  amplifier 

b)  Transmission  lines  and  passive  circuit  components 

c)  Deposition  and  processing  of  dielectric  layers 

d)  Low  loss  vertical  interconnects. 

Power  Amplifier. 

There  are  many  possibilities  for  the  implementation  of  the  power  amplifier.  The  complete 
power  amplifier  can  be  processed  as  a  monolithic  integrated  circuit  using  either  HBTs  or  FETs, 
Since  there  is  a  very  successful  development  program  on  thermally  shunted  HBT  power  devices 
(Figure  5)  at  the  Devices  Branch  of  the  Electronics  Division  of  the  Avionics  Laboratory  (AADD)  of 
the  Wright  Laboratory  [14],  it  will  be  an  excellent  opportunity  to  incorporate  these  devices  in  the 
proposed  3D  T/R  modules.  This  will  also  allow  close  collaboration  between  the  various  branches 
of  the  Electronics  Division.  The  combined  expertise  of  the  various  branches  will  lead  to  the 
development  of  the  best  circuit  topology  for  maximum  operational  efficiency. 


Figure  5.  Thermally  shunted  power  HBT  developed  at  AADD  of  Avionics 

Laboratory  (Ref.  14). 

Transmission  Lines 

There  are  many  transmission  line  configurations  that  can  be  used  in  the  implementation  of 
the  vertically  interconnected  3D  T/R  modules  [15,16].  Microstrip  lines,  strip  lines,  slot  lines  and 
coplanar  waveguides  are  the  best  possible  candidates  that  can  be  used  with  these  modules.  Because 
of  the  unsymmetric  nature  of  the  slot  line,  it  has  limited  applicability  and  in  general  is  used  for  line 
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couplings  between  various  layers.  In  many  cases,  circuit  planes  will  be  sandwiched  between  two 
conducting  ground  planes.  Therefore  microstrip  lines  are  replaced  by  stripline  type  transmission 
lines.  If  the  upper  dielectric  layer  of  a  circuit  plane  is  relatively  thick  compared  to  the  lower 
dielectric  layer,  microstrip  like  transmission  lines  can  still  be  implemented  with  a  slight 
modification  to  the  line  characteristic.  In  this  case,  the  strip  is  now  embedded  completely  in  a 

dielectric  material  of  er. 


Etch  #1 


Etch  #2 


Figure  6.3  Micromachining  steps  in  Silicon  for  attaching  the  HBT  chip.  The  transistor  shunt  metal 

makes  direct  contact  with  Si-substrate. 


(c)  Coplanar  Waveguide  (d)  Coupled  Coplanar  Waveguide 


Figure  7.  Transmission  line  suitable  for  3D  interconnects. 
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In  selecting  transmission  line  structures  best  suited  for  the  proposed  3D  T/R  module,  the 
lines  should  have  low  loss,  easy  to  process  and  above  of  all  they  should  be  less  sensitive  to  the 
thickness  of  the  processed  dielectric  layers.  If  the  thickness  of  the  dielectric  layer  can  be  controlled 
accurately,  the  choice  of  the  transmission  line  structure  will  not  be  critical. 

For  a  circuit  plane  sandwiched  between  two  ground  planes,  striplines  (SL)  and  coplanar 
waveguides  (CPW)  are  the  two  best  transmission  line  configurations  that  can  be  used  in 
implementing  the  passive  elements  of  a  vertically  interconnected  3D  module. 

Figure  7  shows  the  stripline  and  the  coplanar  waveguide  in  the  presence  of  both  upper  and 
lower  conducting  planes.  If  the  upper  conducting  ground  plane  is  not  present,  the  stripline 
becomes  a  modified  microstrip  transmission  lines  imbedded  in  a  dielectric  material.  Already 
successful  power  coupling  configurations  from  one  layer  to  the  other  are  implemented  when  the 
lines  are  embedded  within  multiple  dielectric  layers  [2]. 
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a  b  (a) 

h2 
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Figure  8.  Conformal  mapping  of  the  CPW  with  top  and  bottom 
ground  planes  immersed  completely  in  a  dielectric  material. 
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In  order  to  verify  the  suitability  of  CPW  for  the  proposed  3D  modules,  the  effect  of  the 
upper  and  lower  ground  planes  on  the  impedance  characteristic  of  the  coplanar  waveguide  is 
investigated.  The  geometry  given  in  Figure  8a  represents  a  typical  coplanar  waveguide  imbedded  in 
a  dielectric  material  sandwiched  between  two  conducting  planes  separated  by  distances  h,  and  h2. 
A  quasi-electrostatic  solution  for  the  CPW  can  be  found  by  using  conformal  transformations  [17- 
19].  The  geometry  given  in  Figure  8. a  is  treated  as  two  separate  problems,  one  for  the  upper  and 
one  for  lower  parts  of  the  transmission  line.  Both  parts  can  be  transformed  into  the  z-plane  through 
the  transformation 


TCX 

z  =  tanh( — ) 
2  h 


The  transformed  z-plane  coordinates  are  shown  in  Figure  8b.  The  z-plane  is  then 
transformed  into  the  w-plane  using  elliptic  transformations  (Figure  8c).  The  resulting  characteristic 
impedance  can  be  written  as 


Here 


Z„  =  60  K 


1 


( 


Kite  ,  K(k3) 


(Q) 


K(k'  2 )  K(k\ ) 


) 


(1) 


k3  =  tanh(^a/2/z,)/tanh(^f>/2/z]) 
k2  =  tanh(TO3/2/z2)/tanh(^&/2/z2) 

and  K(k)  is  the  Complete  Elliptic  Integral  of  the  First  Kind  with  argument  k  and  k'  =  V 1  —  kz . 
Since  the  whole  line  is  imbedded  in  a  dielectric  material,  the  £eff=Er. 

A  FORTRAN  program  is  written  and  the  thickness  of  the  dielectric  layers  between  the 
upper  and  lower  ground  planes  are  taken  to  be  equal.  For  a  gap  of  2  pm  and  width  of  20  pm,  the 

variation  of  the  line  impedance  as  a  function  of  the  dielectric  thickness  h  is  plotted  as  shown  in 
Figure  9.  The  impedance  is  not  effected  as  the  dielectric  thickness  becomes  greater  than  twice  the 
overall  separation  distance  of  the  two  side  ground  planes.  Therefore,  for  dielectric  layer  thickness 
grater  than  2(W+2S),  the  CPW  parameters  will  not  be  critically  dependent  on  the  small  differences 
in  the  thickness  of  the  deposited  dielectric  layers. 
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(  W=20  pm,G=2pm  h=50pm) 

Figure  9.  Impedance  variation  as  a  function  of  ground  plane  to  CPW  distance  hfor 

£=2.5. 

In  order  to  show  the  feasibility  of  utilizing  CPWs  as  distributed  circuit  elements,  a  3  section 
equal  ripple  coupled-line  filter  is  designed  and  simulated.  The  line  parameters  for  the  odd  and  even 
mode  impedance  of  the  coupled  CPW  are  calculated  beginning  from  a  low  pass  filter  prototype 
using  the  insertion  loss  method.  The  physical  dimensions  of  the  coupled  CPW  lines  are  then  found 
from  the  Line_Calc  program  of  Libra  simulation  software.  These  are  tabulated  in  Table  II  for  h=50 

pm  and  £=2.5.  The  layout  of  the  filter  is  shown  in  Figure  9a. 


Figure  9.  (a)  Layout  and  (b)  simulated  S  parameters  of  a  CPW  3-section  coupled 

equal  ripple  filter. 
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Table  II.  Parameters  of  the  coupled  coplanar  waveguide  filter 


Section 

^odd 

^even 

Gap  (C-C) 

Width 

S  (C-G) 

1 

40.2 

88.9 

10 

292.2 

24.6 

2 

47.3 

74.2 

10 

506.6 

118.5 

3 

47.3 

74.2 

10 

506.6 

118.5 

40.2 

88.9 

10 

292.2 

24.6 

Using  Libra,  the  circuit  is  simulated  and  the  resulting  IS21I  and  ISnl  are  plotted  in  Figure  9b. 
It  should  be  noted  that  coupled  CPW  line  parameters  used  in  the  Libra  simulations  are  for  a  CPW 
with  the  upper  layer  in  air. 

Dielectric  Deposition. 

One  of  the  crucial  components  of  a  vertically  interconnected  3D  system  is  the  choice, 
deposition  and  processing  of  the  dielectric  material.  There  are  many  different  dielectric  materials 
that  are  used  in  the  processing  of  semiconductor  devices.  Table  III  lists  the  properties  of  some  of 
the  dielectric  materials  used  in  the  implementation  of  3D  interconnects.  The  parameters  in  this  table 

are  for  a  50  O  microstrip  transmission  line.  Gold  metal  is  used  to  calculate  the  conductor  losses. 

.  Dielectric  materials  to  be  used  in  vertically  interconnected  microwave  and  millimeter  wave 
modules  should  have  some  stringent  mechanical,  chemical  and  electrical  properties.  These  can  be 
summarized  as: 

a)  deposition  of  uniform  and  controlled  thickness  layers 

b)  adhesion  to  metal  (gold)  surface 

c)  adhesion  of  metal  (gold)  on  the  dielectric 

d)  low  shrinkage  during  curing 

e)  compatibility  with  photoresists 

f)  prossesable  with  wet  and  dry  etching  techniques 

g)  maintain  its  properties  over  wide  range  of  temperature  operation. 

Polyimide  is  one  of  the  widely  used  dielectric  materials  in  3D  interconnect  circuitry.  In  this 
work  a  new  dielectric  material  referred  to  as  ‘Sycar’  developed  by  Hercules  Inc.  Sycar  will  be  used 
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due  to  its  excellent  moisture  resistance,  electrical  properties,  ionic  purity  and  ease  of  handling.  It  is 
applied  by  spin  coating  and  curing  at  200°  C  [17]. 


Table  II.  Properties  of  dielectric  materials  used  in  3D  interconnected  modules. 


BBC 

Si)2 

Sycar 

Polyimide 

Deposition 

Process 

Spin  coat  and 

bake 

CVD 

spin  coat  and 

bake 

spin  coat  and 

bake 

Dielectric 

Constant 

2.7 

4.0 

2.6 

3. 1-3.6 

Tan  d 

0.004 

0.03 

0.003 

0.001 

Possible 

thickness  (pm) 

26 

9 

25 

2.5-25 

Loss/A,Adielectric 

0.093 

0.712 

0.070 

0.023 

(dB)  [conductive 

0.812 

2.307 

0.838 

7.327-0.876 

It  is  very  important  that  during  the  deposition  process,  the  thickness  of  the  dielectric  layers 
should  be  controlled  to  provide  the  desired  thickness  uniformity.  Any  variations  in  the  thickness  of 
the  layers  will  prevent  successful  implementation  of  the  circuit  elements  especially  those  of  the 
distributed  line  elements  whose  parameters  are  likely  to  depend  on  the  dielectric  thickness.  As  has 
been  discussed  above,  distributed  line  elements  such  as  CPW  are  recommended  for  the 
implementation  of  the  transmission  lines  since  they  are  less  dependent  on  the  dielectric  thickness 
provided  their  lateral  dimensions  are  small  compared  to  the  thickness  of  the  dielectric  material 

Mechanical  properties  of  the  dielectric  material  also  plays  a  very  important  role  in  the 
implementation  of  the  3D  interconnects.  Shrinkage  during  processing,  stability  of  the  material, 
adhesion  to  metals  as  well  as  adhesion  of  metal  on  the  dielectric,  pealing  strength  of  the  metal  on 
the  dielectric  are  some  of  the  properties  that  have  to  thoroughly  investigated. 

Compatibility  of  photoresists  with  the  dielectric  material  and  the  type  of  etching  process 
such  as  wet  or  dry  etching  are  also  major  steps  in  the  successful  implementation  of  the  3D 
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interconnects.  It  is  possible  that  there  may  be  a  need  for  mechanical  surface  removal  and  polishing. 
Thus  the  dielectric  material  should  also  be  robust  enough  to  allow  this  type  of  abuse. 

Vertical  Interconnects 

Another  major  component  of  a  3D  T/R  module  is  signal  routing  a  microwave  or  millimeter 
waves  from  one  level  to  another.  In  some  of  the  3D  interconnects  used  in  MMIC  processing  novel 
coupling  schemes  between  various  layers  are  already  used.  If  a  complete  T/R  module  as  shown  in 
Figure  1  is  to  be  implemented,  several  ground  planes  will  separate  various  circuit  levels  from  each 
other.  Therefore,  it  will  be  necessary  to  couple  signal  from  one  layer  to  the  other  by  means  of 
vertical  posts  (vias).  IF  CPWs  are  used,  it  is  possible  that  parallel  transmission  line  TEM  modes 
may  be  also  excited  in  addition  to  CPW  TEM  modes.  To  prevent  these  unwanted  modes,  shorting 
connections  between  the  top  and  lower  ground  planes  and  the  side  ground  planes  of  the  CPW  have 
to  be  made.  These  will  also  require  additional  vertical  posts. 


Front  view 

Figure  10.  Vertical  Interconnecting  posts  between  layers.  Additional  posts  to  prevent 
possible  parallel  plate  TEM  modes  are  also  shown. 
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Figure  10  shows  the  connecting  posts  (vias)  from  one  layer  to  the  next.  Even  though  the 
posts  are  shown  as  solid  uniform  cylinders,  their  shape  may  be  intentionally  modified  to  decrease 
the  return  loss.  The  cross  section  of  the  posts  may  not  be  uniform  vertically  as  a  result  of  the 
processing  steps  in  the  electrodeposition  of  the  posts.  In  either  case,  extensive  simulations  and 
refined  processing  steps  will  be  necessary  to  find  the  best  possible  coupling  configuration.  In 
Figure  10,  additional  posts  connecting  the  ground  planes  and  the  side  ground  electrodes  of  the 
CPWs  are  also  shown. 

There  are  two  possible  processing  procedures  in  realizing  the  vertical  interconnects.  The 
first  one  is  to  initially  spin  coat  and  cure  the  dielectric  material  to  the  required  thickness.  Photoresist 
is  then  used  to  locate  the  post  locations.  Through  a  chemical  or  wet  etching  process,  the  dielectric 
material  is  removed  from  these  openings.  Then  gold  metal  is  electro-deposited  in  these  holes  to  the 
height  of  the  dielectric  layer.  Necessary  circuit  or  ground  metal  plane  is  then  deposited.  The  second 
process  involves  first  the  deposition  of  a  ground  plane  over  the  dielectric.  Using  photoresist,  the 
post  locations  and  sizes  are  exposed.  Metal  posts  are  then  electrodeposited  to  the  required  height. 
The  photo  resist  is  then  removed  and  the  dielectric  layer  is  spin  coated  and  cured.  This  process  will 
form  mesas  over  the  posts.  Thus  mechanical  lapping  may  be  required  to  planarize  the  upper  surface 
of  the  dielectric  material  to  the  depth  of  the  metal  posts.  Even  though  this  process  seems  to  be 
attractive  in  the  overall  implementation  of  the  3D  interconnects,  required  mechanical  lapping  may 
not  be  desirable. 


Modeling  and  Simulations 

In  order  to  implement  an  optimum  vertically  interconnected  3D  T/R  module,  extensive 
component  and  circuit  simulations  are  necessary. 

Even  though  circuit  simulation  software  such  as  Libra,  MDS  and  Compact  are  available, 
they  do  not  include  some  of  the  modified  circuit  topology  that  will  be  used  with  the  3D  T/R 
modules.  For  example,  the  CPW  model  used  in  Libra  is  for  a  CPW  with  the  upper  surface  exposed 
to  air.  In  the  CPWs  that  will  be  used  in  the  3D  modules,  the  electrodes  will  be  completely 
immersed  in  a  dielectric  material  and  there  will  also  be  the  upper  and  lower  ground  planes.  In  the 
quasistatic  solution  given  in  Equation  1  for  the  CPW,  both  the  dielectric  and  the  presence  of  the 
upper  and  the  lower  ground  planes  are  included.  Since  Libra  does  not  have  this  type  of  circuit 
element  in  the  Library,  either  a  new  library  element  has  to  be  added  to  the  simulator,  or  a  new 
simulation  procedure  has  to  be  developed.  This  is  also  true  for  the  coupled  CPWs.  The  filter 
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simulations  presented  in  Figure  9  are  made  using  the  conventional  CPW  models  that  are  available 
in  Libra.  The  response  of  the  filter  will  be  modified  by  the  presence  of  the  upper  ground  plane  as 
well  as  complete  immersion  of  the  CPW  in  the  dielectric  medium.  The  lack  of  these  elements  will 
generate  large  errors  in  the  simulation  results  especially  when  resonant  circuit  elements  are  used  in 
a  circuit. 

Some  of  the  simulation  software  also  lacks  detailed  device  modeling  especially  for  the 
power  devices  [19,20],  Since  temperature  effects  modify  the  operating  characteristics  of  power 
devices  such  as  HBTs,  new  nonlinear  models  has  to  be  introduced  into  the  software  or  new  device 
files  have  to  generated  and  used  with  SPICE  like  programs. 

There  are  also  many  electromagnetic  simulators  available.  Among  these,  Microwave  Lab 
and  Sonnet  are  the  two  software  which  are  exclusively  dedicated  to  modeling  planar  circuit 
elements.  CPWs  and  the  vertical  interconnects  will  be  simulated  using  these  software.  Even  though 
these  programs  require  large  computer  memory  to  ran  complicated  geometry,  they  provide  both 
electric  and  magnetic  field  distributions,  current  distributions  in  the  conductors  as  wells  the  S 
parameter  data  at  a  given  frequency.  These  are  essential  in  the  successful  implementation  of  various 
circuit  components  especially  in  finalizing  the  shapes  and  locations  of  the  interconnects  for 
minimum  return  loss. 

It  can  be  concluded  that  successful  implementation  of  3D  T/R  modules  will  require 
extensive  and  careful  device  and  circuit  modeling  and  simulations  in  addition  to  developing  novel 
processing  techniques. 


Short  Term  Goals 

The  short  term  goals  of  this  research  project  is  to  implement  a  microwave  transmitter  with 
and  integral  power  amplifier,  a  bandpass  filter  and  a  patch  antenna  similar  to  the  layer  structure 
given  in  Figure  4.  The  major  emphasis  in  this  phase  of  the  project  is  to  develop  a  reliable  and 
controllable  dielectric  deposition  process  so  that  necessary  number  of  dielectric  layers  can  be 
deposited.  The  power  amplifier  will  be  designed  to  operate  either  in  class  B  or  class  AB 
configuration  and  will  be  processed  on  a  SI  Si-substrate.  The  first  dielectric  layer  will  be  deposited 
over  the  substrate  followed  by  the  first  ground  plane.  Between  the  second  and  third  dielectric 
layers,  a  3  section  bandpass  coupled  CPW  filter  with  equal  ripple  transmission  characteristic  will 
be  processed.  In  addition  to  demonstrating  the  feasibility  of  using  CPWs,  the  filter  circuitry  will 
prevent  the  higher  harmonics  of  the  amplified  signal  to  reach  the  patch  antenna.  Over  the  fourth 
dielectric  layer,  a  patch  antenna  will  be  processed.  In  order  to  optimized  the  patch  antenna  radiation 
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characteristics,  the  thickness  of  this  final  layer  may  be  thickener  than  the  pervious  three  layers. 
Vertical  interconnects  between  the  power  amplifier  and  the  filter  and  between  filter  and  the  patch 
antenna  will  be  processed  and  optimized  for  minimum  transmission  loss.  This  circuit  essentially 
includes  all  the  major  components  and  processing  steps  necessary  in  implementing  the  final  3D  T/R 
module.  The  transmitter  will  be  experimentally  tested  in  detail  to  verify  the  various  parameters 
predicted  by  various  modeling  and  simulations. 

The  initial  phase  of  the  program  will  be  concentrated  in  the  deposition  of  the  dielectric 
layers.  The  uniformity  as  well  as  the  reproducibility  of  these  layers  are  very  important.  Thickness 
control  is  also  a  major  part  of  the  development  program.  If  a  successful  dielectric  deposition 
process  is  developed  and  the  thickness  of  the  deposited  layers  can  be  controlled  to  within  a 
micrometer,  the  ensuing  circuit  designs  will  be  relatively  simplified. 

Long  Term  Goals 

Long  term  goals  of  the  project  will  be  to  implement  complete  T/R  modules  and  test  them  in 
a  2x2  array  for  their  applicability  in  a  conformal  phased  array  antenna  system.  This  challenging 
task  will  be  achieved  through  extensive  theoretical  analysis,  simulations  and  experimental  testing  of 
various  components  and  circuit  modules. 

Conclusion 

In  this  report,  a  novel  vertically  interconnected  3D  transmit-receive  module  using  an 
integral  patch  antenna  is  proposed  for  a  conformal  phased  array  antenna  system.  Deposition  of 
uniform  dielectric  material  and  vertical  interconnects  between  various  circuit  modules  are  a  major 
part  of  the  development  program.  The  feasibility  of  the  proposed  program  will  be  demonstrated  by 
a  microwave  transmitter  using  a  four  dielectric  layers  incorporating  a  thermally  shunted  HBT 
developed  at  Wright  Laboratories.  Coplanar  waveguides  will  be  incorporated  as  the  transmission 
medium.  Extensive  theoretical  modeling  and  simulation  will  complement  the  experimental  testing 
of  the  transmitter  modules.  Emphasis  is  placed  on  designing  and  implementing  low  loss  vertical 
interconnects. 
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KINETIC  STUDIES  OF  THE  THERMAL  DECOMPOSITION 
OF  DEMNUM  AND  X-1P  USING  THE 
SYSTEM  FOR  THERMAL  DIAGNOSTIC  STUDIES  (STDS) 


Dennis  R.  Flentge 
Professor 

Department  of  Science  and  Mathematics 
Cedarville  College 


Abstract 

The  System  for  Thermal  Diagnostic  Studies  (STDS)  links  a  thermal  reactor  cell 
directly  to  a  GC/MS  for  the  examination  of  gas  phase  behavior  of  materials  as  a 
function  of  temperature,  time,  and  atmosphere.  Demnum,  a  perfluoropolyalkyl  ether, 
has  been  studied  as  a  high  temperature  lubricant  while  X-1P,  a  cyclotriphosphazene, 
has  shown  potential  as  a  vapor  phase  lubricant.  The  decomposition  of  Demnum  S- 
65  was  studied  in  the  temperature  range  of  300°C  to  750°C  and  X-1P  was  studied 
between  300 °C  and  625  °C.  Rate  constants  for  the  decomposition  were  determined 
for  each  substance. 
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KINETIC  STUDIES  OF  THE  THERMAL  DECOMPOSITION 
OF  DEMNUM  AND  X-1P  USING  THE 
SYSTEM  FOR  THERMAL  DIAGNOSTIC  STUDIES  (STDS) 

Dennis  R.  Flentge 


Introduction 

As  the  demands  for  increased  efficiency  and  greater  power  are  made  on 
turbine  engines,  the  need  for  lubricants  that  function  well  at  higher  temperatures 
has  grown.  In  addition  to  searching  for  liquids  that  are  stable  at  high 
temperatures,  scientists  and  engineers  have  been  examining  materials  that  work 
as  vapor  phase  lubricants.  Demnum,  a  perfluoropolyalkyl  ether,  has  been  studied 
as  a  high  temperature  liquid  lubricant  while  the  cyclotriphosphazene  X-1 P  shows 
potential  as  a  vapor  phase  lubricant  ( 1 ,  2).  A  knowledge  of  the  decomposition, 
reaction  rate  constants,  and  activation  energies  of  these  compounds  should 
contribute  to  the  understanding  of  the  processes  occurring  at  elevated 
temperatures. 

Methodology 

The  System  for  Thermal  Diagnostic  Studies  (STDS)  was  designed  by 
Rubey  of  the  University  of  Dayton  Research  Institute  to  study  incineration  of 
hazardous  wastes  (3,  4).  Since  the  STDS  provides  control  of  temperature, 
reactive  atmosphere,  residence  time  of  the  reactant  in  the  reaction  vessel, 
pressure,  and  degree  of  mixing,  it  is  also  an  excellent  tool  to  study  the  kinetics  of 
vapor  phase  lubricant  samples  (5). 
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The  STDS  contains  four  integrated  components.  First,  there  is  a  control 
console  that  regulates  the  temperature  of  the  reactor  cell  and  the  flow  of  the 
reactor  gas  through  it.  The  second  component,  a  thermal  reaction  compartment, 
is  a  Hewlett-Packard  model  5890A  gas  chromatograph  modified  so  that  it  can 
house  the  quartz  reactor  cell  and  the  high  temperature  furnace.  An  unmodified 
model  5890A  gas  chromatograph  is  the  third  component.  The  thermal  reaction 
products  and  unreacted  parent  material  are  swept  into  this  chamber  and  are 
deposited  near  the  beginning  of  the  chromatographic  column.  A  Hewlett-Packard 
model  5970B  mass  spectrometer  is  the  last  component  in  the  system.  Materials 
separated  by  the  gas  chromatograph  are  analyzed  by  the  spectrometer. 

Temperature,  residence  time  of  the  parent  material  in  the  reactor,  and  the 
atmosphere  in  the  reactor  were  components  of  interest  in  this  study.  The  injector 
leading  to  the  reactor  cell  was  kept  at  225°C  and  the  oven  surrounding  the  reactor 
cell  and  furnace  was  at  300°C.  The  temperature  of  the  reactor  cell,  controlled  by 
the  furnace,  was  set  at  values  ranging  from  300°C  to  750°C.  Helium  was  the 
reactor  gas  for  Demnum  S-65  while  air  was  used  for  X-1 P.  Residence  time  was 
established  by  the  flow  rate  of  the  reactor  gas. 

A  0.1  microliter  sample  of  the  neat  liquid  was  injected  into  the  reaction 
compartment  and  the  reactor  gas  carried  the  vaporized  sample  into  the  reactor 
cell.  The  flow  was  maintained  for  ten  (10)  minutes  to  insure  that  the  entire  sample 
had  time  to  be  carried  through  the  reactor  and  that  all  of  the  products  and 
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unreacted  parent  material  had  been  transported  to  the  beginning  of  the  gas 
chromatographic  column.  Then  the  reactor  gas  flow  was  reduced  and  the  helium 
flow  through  the  chromatographic  column  was  increased.  The  column  was 
flushed  two  minutes  to  carry  all  of  the  reactor  gas  through  the  gas  chromatograph 
and  past  the  detectors  of  the  mass  spectrometer.  Then  the  program  controlling 
the  temperature  increase  of  the  gas  chromatograph  and  the  detection  of  particles 
in  the  mass  spectrometer  was  started.  The  temperature  was  raised  from  50°C  to 
300 °C  at  a  rate  of  5°/minute  and  was  held  at  300°C  for  15  minutes  for  the 
Demnum  S-65  samples  and  for  10  minutes  for  the  X-1P  samples. 

Reaction  rates  were  followed  by  measuring  the  areas  of  peaks  associated 
with  fragments  of  the  molecules  under  study  or  of  their  decomposition  products. 
Parent  molecules  were  not  observed  because  their  molar  masses  are  above  the 
range  detected  by  the  mass  spectrometer  (35-800  amu). 

The  mass  spectrometer’s  sensitivity  varied  from  day  to  day.  A  sample  of 
the  parent  material  injected  into  the  reactor  set  at  300 °C  served  as  the  standard 
for  each  day’s  data.  All  data  collected  on  that  day  were  adjusted  using  the  ratio  of 
the  total  peak  area  of  the  daily  standard  to  the  peak  area  of  a  sample  measured  at 
the  beginning  of  the  data  collection  for  a  substance. 

The  chromatographic  column,  which  is  connected  directly  to  the  mass 
spectrometer,  draws  gas  into  itself  at  a  rate  that  is  independent  of  the  flow  rate  of 
the  reactor  gas.  Samples  of  1%  dodecane  dissolved  in  hexane  were  injected  at 
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several  different  reactor  gas  flow  rates.  A  plot  of  the  log  of  the  peak  area  versus 
flow  rate  gave  a  relationship  used  to  correct  the  data  for  the  effect  of  flow  rate. 

Demnum  S-65,  a  perfluoropolyalkyl  ether,  is  a  blend  of  oligimers  that  has 
an  average  molecular  weight  of  about  4500.  Its  formula  is  F-(CF2CF2CF2-0)n- 
CF2CF3.  The  average  value  of  n  is  26  but  its  value  may  range  from  less  than  1 5  to 
more  than  40.  As  a  high  temperature  lubricant  candidate,  its  thermal 
decomposition  temperature  and  the  products  formed  in  the  decomposition  are 
important. 

X-1 P  is  also  a  blend  of  several  molecules  with  the  general  formula 


N3P3Rrr>R’6  -  m>  where  m  takes  values  from  0  to  6.  R  is  the  4-fluorophenoxy  group 
and  R’  is  the  3-trifluoromethylphenoxy  group.  The  most  abundant  molecule  in  the 
mixture  has  the  molecular  formula  N3P3R2R’4.  The  mixture  contains  a  set  of 
molecules  with  formulas  that  have  100%  R  groups  (N3P3R6)  to  those  with  100%  R’ 
groups  (N3P3R’6). 

Results 
Demnum  S-65 
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Figure  1  shows  the  spectrum  of  Demnum  S-65  passed  through  the  reactor 
cell  at  300°C.  The  complex  spectrum  is  a  consequence  of  the  wide  range  of 
oligimers  present  in  the  mixture.  The  primary  mass  number  that  accounts  for  the 
spectrum  corresponds  to  a  C3F7  fragment  which  separates  from  the  parent 
molecule.  The  total  area  associated  with  this  peak  was  the  basis  for  the 
evaluation  of  kinetic  data. 


Figure  1 

Figure  2  contains  the  spectrum  of  Demnum  S-65  passed  through  the 
reactor  cell  at  750°C.  The  rate  constant  for  the  thermal  decomposition  reaction  at 
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750°C  was  determined  by  plotting  the  total  peak  area  versus  residence  time. 
Figure  3  shows  a  plot  of  the  data  that  yields  a  rate  constant  of  1 .59  s'1.  One  goal 
of  this  project  was  to  measure  the  activation  energy  for  this  reaction.  However,  a 
shift  in  the  needs  of  the  laboratory  led  to  the  kinetic  study  of  X-1 P. 


Figure  2 
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Rate  Constant  Determination  @  750  C 


Figure  3 

X-1P 

Figure  4  shows  the  spectrum  of  X-1P  passed  through  the  reactor  cell  at 
300°C.  These  peaks  represent  the  contribution  of  6  fragments  from  the  X-1 P 
molecule.  Samples  heated  to  higher  temperatures  showed  the  same  spectral 
features.  Figure  5  shows  the  plot  of  peak  area  versus  temperature.  The  shape  of 
the  curve  shows  that  a  reaction  is  occurring  at  temperatures  below  500 °C. 
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Figure  4 


Fsak  Area  versus  Temperature,  X-1P 
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Figure  5 

Rate  constant  studies  were  done  at  560°C  and  545°C.  Figures  6  and  7 
contain  the  data  gathered  in  -friose  studies.  Examination  of  the  plots  shows  a  wide 
scatter  in  the  data  points.  This  is  due  primarily  to  the  decline  in  the  performance 
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of  the  mass  spectrometer  during  August.  Efforts  made  to  compensate  for  this 
problem  were  unsuccessful. 


Rate  Constant  Determination  @  560  C 


Figure  6 


Rate  Constant  Determination  @  545  C 


Figure  7 
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Conclusions  and  Recommendations 


The  STDS  shows  excellent  potential  as  an  instrument  to  examine  the 
behavior  of  vapor  phase  lubricants.  Results  from  the  study  of  Demnum  S-65 
show  that  high  quality  kinetic  data  can  be  obtained  from  the  system.  In  addition  to 
the  ability  to  separate  and  analyze  products  that  can  be  trapped  on  a  column  at 
50 °C,  it  can  also  retain  molecules  of  lower  molar  mass  using  a  cryostatic  trap. 

This  allows  one  to  examine  the  entire  range  of  reaction  products  from  the  thermal 
decomposition  of  the  lubricants. 

The  next  step  in  this  study  is  to  measure  rate  constants  for  X-1P  at 
temperatures  below  500°C  and  determine  the  activation  energy  of  the  reaction. 
The  cryostatic  trap  could  be  used  to  identify  low  molar  mass  products  of  the 
decomposition. 

Additional  kinetic  studies  on  Demnum  S-65  can  be  completed  and 
activation  energies  calculated.  Analysis  of  low  molar  mass  components  of  the 
decomposition  may  identify  corrosive  products  that  cause  rapid  failure  in  systems 
using  Demnum  as  a  lubricant. 
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IMPROVEMENT  OF  RIGID  ROD  POLYMERS 


Himansu  M.  Gajiwala 
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Abstract 

Rigid  rod  polymers  like  PBZT,  PBZO  and  PBZI  are  known  for  their  excellent 
thermal  stability,  high  modulus  and  high  strength.  However,  one  of  the  major  drawback 
of  these  polymers  is  their  low  compressive  strength.  To  overcome  this  problem  a  novel 
approach  was  taken  and  the  new  monomer  was  synthesized.  This  monomer  was 
synthesized  by  reacting  4,4’-dibromo-2,2'-bis(4-methylbenzoxazolyl)biphenyl  with  2- 
amino-p-cresol  in  PPSE,  o-DCB  mixture  at  an  elevated  temperature.  The  bromide 
functionality  was  subsequently  replaced  by  cyano  group  and  then  hydrolysed  to  the 
carboxylic  acid.  Computer  simulation  on  this  monomer  has  indicated  the  non-planar 
geometry.  The  bulkiness  and  the  thermal  stability  of  the  side  group  is  indicative  of  the 
non  planarity  of  the  biphenyl  rings  in  thermally  treated  polymers.  This  will  result  in  3-D 
cross-linked  polymer  of  high  compressive  strength. 
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NOVEL  APPROACH  FOR  THE  COMPRESSIVE  STRENGTH 
IMPROVEMENT  OF  RIGID  ROD  POLYMERS 


Himansu  M.  Gajiwala 


INTRODUCTION 

The  rigid-rod  polybenzobisazole  polymers  that  have  received  the  greatest 
attention  in  the  literature  are  poly([benzo(l,2-d:4,5d')bisthiazole-2,6-diyl]-l,4-phenylene) 
(PBZT),  poly([benzo(l,2-d:5,4-d’)bisoxazole-2,6-diyl]-l,4-phenylene)  (PBZO)  and 
poly([  1 ,7-dihydrobenzo(  1 ,2-d:4,5-d')bisimidazole-2,6-diyl]  - 1 ,4-phenylene)  (PBZI) 
systems. 


PBZO 


PBZI 


All  these  polymers  have  been  prepared  by  the  polycondensation  reactions  in 
polyphosphoric  acid.  With  the  exception  of  PBZI,  these  polymers  can  be  obtained  as  a 
lyotropic  liquid  crystalline  solution.  All  these  rigid  rod  polymers  are  soluble  in  strong 
acids  like  methane  sulphonic  acid  (MSA),  chlorosulphonic  acid  (CSA),  sulfuric  acid,  etc. 
For  all  these  polymers,  as  their  glass  transition  temperature  is  higher  than  their 
decomposition  temperature,  they  can  be  processed  only  from  the  solution.  Processing  of 
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these  rigid  rod  polymer  solutions  from  the  liquid  crystalline  state  produces  very  high 
modulus,  high  strength  fibers  and  films.  For  PBZT  and  PBZO  polymers,  modulus  values 
of  300GPa  with  tenacity  values  of  3.0GPa  have  been  reported  on  wet-spun  fibers  after 
heat  treatment.  However,  one  of  the  major  drawback  of  these  polymers  is  their  low 
compressive  strength.  For  example,  the  synthetic  rigid  organic  polymer  like  Kevlar  49 
has  the  compressive  strength  of  365MPa  whereas  PBZT  and  PBZO  have  the  compressive 
strengths  of  270  and  200MPa  respectively. 

The  low  values  of  compressive  strengths,  in  case  of  PBZT  and  PBZO  polymers 
can  be  attributed  to  their  structural  features.  Morphology  studies  of  PBZT  and  PBZO 
fibers  spun  from  liquid  crystalline  solutions  have  shown  that  fibers  are  the  network  of 
micro  fibrils  with  no  transverse  coupling  between  the  fibrils.  This  structural  feature 
results  in  microfibrilar  buckling  when  compressed,  resulting  into  the  low  compressive 
strength.  In  the  case  of  Kevlar,  larger  force  is  required  to  observe  such  buckling 
phenomena  as  fibrils  are  interconnected  by  weak  hydrogen  bonds.  However,  PAN  based 
C-fibers  show  the  highest  compressive  strength  values  because  of  cross-linking  and/or 
interwining  of  the  ribbon  like  chain  structures.  Based  on  this  correlation,  it  can  be 
generalized  that  if  the  fibers  have  the  interconnected  network  than  higher  value  of  the 
compressive  strength  can  be  obtained. 

Cross-linking  is  one  of  the  approach  that  has  been  attempted  in  past  to  improve 
the  compressive  properties  of  PBZT  and  PBZO  fibers.  Cross-linking  was  achieved  by 
synthesizing  the  monomers  containing  thermally  labile  methyl  and/or  cyclobutane  rings. 
Copolymers  were  synthesized  using  these  modified  as  well  as  the  conventional 
monomers.  Resulting  copolymers  were  processed  initially  by  the  standard  technique 
using  the  lyotropic  liquid  crystalline  dope.  The  fibers/films  thus  obtained  were  then  heat 
treated  to  decompose  the  thermally  labile  groups  (methyl  or  cyclobutane  ring)  and  to 
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generate  highly  reactive  free  radicals.  These  free  radicals  reacted  with  each  other  and 
generated  a  cross-linked  system.  When  such  cross-linked  systems  were  tested  for 
compressive  strength,  insignificant  change  in  the  compressive  strength  was  observed. 
Morphology  studies  on  these  modified  cross-linked  copolymeric  systems  have  revealed  at 
least  three  reasons  for  the  observed  experimental  findings: 

1.  Intramolecular  cross-linking  of  the  system  in  place  of  intermolecular  cross  linking: 

This  happens  when  the  generated  free  radicals  react  intramolecularly  instead  of 
intermolecular  reaction.  For  example,  if  2,2'-dimethyi  biphenyl  moiety  is  used  for  the 
generation  of  the  free  radicals,  then  there  is  a  chance  that  the  non-planar 
dimethylbiphenyl  moiety  will  become  planar  system  as  shown  below  when  the  free 
radicals  are  generated. 


Non-planar  Planar 


If  this  type  of  reaction  occurs  than  the  system  will  not  show  intermolecular  cross- 
linking  and  the  resulting  polymer  will  not  exhibit  improved  compressive  strength. 

2.  Two  dimensional  cross-linking  instead  of  three  dimensional  cross-linking: 

Sometimes  the  polymers  containing  non-planar  monomeric  systems  containing 
the  thermally  labile  groups,  processed  from  the  liquid  crystalline  state,  react 
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intermolecularly  when  heated.  However,  thermodynamically,  the  resulting  network  is  two 
dimensional  rather  than  three  dimensional.  This  also  results  in  an  insignificant  change  in 
the  compressive  strength  of  the  polymer. 

3.  Intrafibrillar  cross  linking  in  place  of  interfibrillar  cross  linking: 

Many  times  it  has  been  observed  that  when  polymers  containing  thermally  labile 
groups  are  heat  treated  than  the  generated  highly  reactive  free  radicals  react  with  the 
system  that  is  in  close  proximity  and  results  in  the  cross-linking  within  the  fibrils.  This 
happens  when  methyl  groups  are  directly  attached  to  the  polymer  backbone.  In  these 
cases,  the  distance  of  the  free  radical  from  the  polymer  backbone  is  not  large  enough  to 
create  the  cross-linking  between  the  fibrils.  On  the  contrary,  this  type  of  system  favors  the 
intrafibrilar  cross  linking.  Thus,  once  again,  under  the  compressive  loading,  individual 
fibrils  will  take  the  load  and  will  result  in  the  buckling  failure  with  insignificant 
improvement  in  the  compressive  failure  load. 

OBJECTIVE 

The  long  term  objective  of  the  present  research  was  to  design  new  PBZT  and 
PBZO  type  of  the  polymers  having  improved  compressive  strength.  The  Reaction 
Schemes  I  and  II  were  designed  to  meet  the  above  mentioned  objective. 

RATIONALE 

The  aromatic  rings  in  the  proposed  biphenyl  monomer  are  locked  into  one 
particular  conformation  due  to  the  bulky  benzoxazole  pendant  rings.  Computer  simulation 
results  have  indicated  that  the  dihedral  angle  between  the  two  aromatic  rings  of  biphenyl 
is  ~63°  and  the  distance  of  the  methyl  groups  on  the  pendant  benzoxazole  rings  from  the 
biphenyl  backbone  is  7.37°A.  This  indicates  that  the  methyl  groups  are  far  away  from  the 
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polymer  backbone.  In  addition,  experimentally  it  has  been  found  that  the  distance 
between  two  fibrils  in  case  of  conventional  PBZT  and  PBZO  is  of  the  order  of  4  °A. 
When  methyl  groups  are  directly  attached  to  the  PBZT  or  PBZO  backbone,  the  methyl 
groups  are  at  the  distance  of  ~2°A  from  the  backbone.  This  results  in  the  intrafibrillar 
cross  linking  that  does  not  improve  the  compressive  strength  significantly.  In  the 
proposed  system,  as  the  methyl  groups  are  farther  away  from  the  backbone  (7.37°A 
instead  of  ~2°A),  this  will  help  them  to  cross-link  with  the  other  methyl  groups  of  the 
polymeric  chains  in  the  different  fibril.  This  will  help  the  interfibrillar  cross-linking 
imparting  better  compressive  strength  to  the  resultant  polymers.  In  addition,  simulation 
results  have  indicated  that  the  methyl  groups  are  in  two  different  planes  and  are  in  a 
somewhat  locked  conformation  because  of  the  bulkiness  of  the  benzoxazole  pendant 
group.  This  will  result  in  three  dimensional  type  of  cross  linking  that  in  turn  will  further 
improve  the  compressive  strength  of  the  polymer. 

EXPERIMENTAL 

4,4'-dibromodiphenic  acid  (Daychem  Inc.)  was  used  after  recrystallization  water. 
2-  amino-p-cresol  (Aldrich)  was  also  used  after  recrystallization  from  water.  Diphenic 
acid(Aldrich)  and  2-aminophenol(Aldrich),  1,2-dichlorobenzene  (O-DCB,  anhydrous, 
Aldrich),  Trimethylsilyl  polyphosphate  (PPSE,  Fluka  Chemicals)  were  used  as  received. 

4,4'  -  dibromo-2,2'  bis(4-methylbenzoxazolyl)biphenyI 

To  the  33mL  mixture  of  PPSE:0-DCB  (14mL:19mL),  2gms  (5mmole)  of  4,4’- 
dibromodiphenic  acid  and  1.23  gms  (10  mmoles)  of  2-amino-p-cresol  were  added.  The 
solution  temperature  was  raised  to  140°C  in  one  hour  with  continuous  stirring  under 
nitrogen  atmosphere.  The  solution  temperature  was  maintained  at  140°C  for  the  total  of 
16  hrs  and  then  maintained  at  160°C  for  an  additional  20  hours.  The  solution  was  then 
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cooled  to  room  temperature  and  added  to  10%  sodium  bicarbonate  solution.  The  organic 
layer  is  then  extracted  using  ethyl  acetate.  Ethyl  acetate  was  then  roto  evaporated  and  the 
product  was  precipitated  as  light  yellow  crystals  by  the  addition  of  hexane.  Required 
product  was  obtained  in  10%  yield. 

Melting  point:  166  -  168°C 

IR:  Absence  of  carbonyl  and  hydroxyl  stretch. 

NMR  in  CDC13:  8.42ppm(d,  J  2.04Hz),  7.63  ppm(d  of  d,  J  8.19Hz  and  2.1Hz), 
7.35ppm  (s),  7.18ppm  (d,  J  8.19Hz),  7.10ppm  (d,  J  8.28Hz),  7.02ppm  (d  of  d,  J  8.4Hz 
and  1.56Hz),  2.39ppm  (s) 

Mass  spectrum:  EIMS  technique,  572,  574,  576 

4,4'  -  dicyano-2,2'  bis(4-methylbenzoxazolyl)biphenyl 

A  mixture  of  4,4'  -  dibromo-2,2'  bis(4-methylbenzoxazolyl)biphenyl  (0.32gms), 
cuprous  cyanide  (0.3gms)  and  dry  N-methyi-2-pyrrolidinone  (3  ml)  was  heated  under 
nitrogen  for  22  hrs  at  170°C.  The  reaction  mixture  was  cooled  to  60°C  and  poured  into 
100ml  of  0.1%(w/v)  sodium  cyanide  solution  and  stirred  at  60°C  for  lhr.  The  resulting 
precipitate  was  filtered  and  washed  with  water  and  dried  to  get  0.32gms  of  the  required 
product. 

IR:  Showed  presence  of  cyano  group  at  2139  cm-1. 

Mass  spectrum:  EIMS  technique,  466,  334,  77 

4,4'  -  dicarboxy-2,2'  bis(4-methy!benzoxazolyl)biphenyI 

A  solution  of  0.3gms  of  4,4'  -  dibromo-2,2'  bis(4-methylbenzoxazolyl)biphenyl, 
2.5gms  of  85%  phosphoric  acid  and  0.99gms  of  phosphorous  pentoxide  was  refluxed  at 
173°C  for  24hrs.  The  solution  was  cooled  to  room  temperature  and  precipitated  in  450ml 
of  distilled  water.  The  precipitate  thus  obtained  was  filtered,  washed  with  water  and 
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dried.  The  product  was  dissolved  in  10%  potassium  hydroxide  and  the  solution  was 
acidified  to  pH  2  using  hydrochloric  acid  to  get  green  color  precipitate.  The  product  was 
filtered,  washed  with  water  and  dried. 

IR:  Absence  of  cyano  group  peak  and  the  presence  of  peak  at  1710cm'1  for  the  carbonyl 
group  stretch  along  with  hydroxyl  group  broad  peak  in  the  range  of  2400-3500cm*1. 

Mass  spectrum:  EIMS  technique,  504,  459,  372, 77 

4,4'-biphenyldicarbonyl  chloride 

Solution  mixture  of  25gms  of  4,4'-biphenyldicarboxylic  acid,  150ml  of  thionyl 
chloride  and  four  drops  of  dimethylformamide  was  refluxed  for  24  hrs.  Excess  thionyl 
chloride  was  distilled  off  using  70  ml  of  toluene  as  the  chaser  solvent.  Toluene  was 
distilled  till  solution  started  forming  the  crystals.  The  solution  was  then  cooled  in  ice  and 
the  clear  toluene  solution  was  decanted.  The  crystals  were  once  again  recrystallized  using 
200ml  of  fresh  toluene.  HPLC  of  the  crystals  indicated  only  one  component  as  expected. 

4,4'-dibenzoxazolebiphenyl  (Model  compound  synthesis) 

Solution  of  4,4'-biphenyldicarbonyl  chloride  (lgm)  and  2-aminophenol  (0.7819 
gms)  in  83%polyphosphoric  acid  (12.4  gms)  was  swirled  at  160°C  under  nitrogen 
atmosphere  for  3  days.  The  solution  was  then  precipitated  on  ice  and  filtered  and  washed 
with  -1000ml  of  hot  water  and  finally  with  acetone  and  dried  to  get  yellowish  white 
granular  material  in  75%  yield. 

Melting  point:  (crude  sample)  350  -365°C 
IR:  1684cm*1, 1605cm*1, 1293cm*1, 

Mass  spectrum:  EIMS  technique  -  388,  296 
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Polybenzoxazole  from  4, 4'-biphenyldicarbonyI  chloride  (Model  polymer  synthesis) 

Measured  amounts  of  4,4'biphenyldicarbonyl  chloride  (2gms)  was  mixed  with 
diaminodithiophenol  (1.7569gms)  in  77%  polyphosphoric  acid  (16.2264gms).  The 
solution  was  slowly  swirled  under  nitrogen  purge  at  60°C  for  2  days.  Additional  amount 
of  phosphorous  pentoxide  (5.7269  gms)  was  added  to  make  83%  polyphosphoric  acid. 
The  solution  temperature  was  then  raised  to  1 00°C  and  swirled  under  nitrogen  overnight. 
Solution  temperature  was  then  raised  to  197°C  over  a  period  of  2  days  and  maintained  at 
that  temperature  for  an  additional  2  days.  The  solution  was  then  cooled  and  precipitated 
on  700  ml  of  ice  cold  water.  The  precipitate  was  filtered,  washed  and  dried  to  get  3.1 
gms  of  the  light  green  color  product.  The  precipitate  was  further  washed  in  soxhelt 
extractor  for  an  additional  two  days  to  get  2.5  gms  of  the  green  color  product. 

Discussion 

In  order  to  synthesize  4,4'  -  dibromo-2,2'  bis(4-methylbenzoxazolyl)biphenyl 
strong  but  neutral  dehydrating  agent  PPSE  was  selected  to  prevent  the  possible 
intramolecular  dehydration  reaction.  However,  with  this  reaction,  the  yield  of  the  required 
product  was  very  low  and  to  get  very  pure  product  column  chromatography  was  essential. 
The  NMR  spectrum  as  well  as  IR  and  mass  spectroscopy  confirmed  that  the  required 
product  was  obtained  by  this  technique.  To  improve  the  yield,  several  experiments  were 
carried  out  by  varying  several  different  reaction  parameters.  However,  significant 
improvement  in  the  yield  was  not  observed.  To  overcome  this,  it  was  decided  to  use 
polyphosphoric  acid  as  the  dehydrating  agent.  When  polyphosphoric  acid  was  used, 
following  reaction  product  was  obtained  in  nearly  quantitative  yield. 
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Structure  of  this  yellowish  green  color  crude  product  as  well  as  the  bright  yellow 
colored  NMP  recrystallized  product  was  confirmed  by  IR  and  Mass  spectrum.  Mass 
spectrum  showed  peaks  at  467,  469  and  471  corresponding  to  M+.  DR.  shows  a  strong 
carbonyl  group  peak  at  1722cm'1.  The  recrystallized  product  had  the  melting  point  of 
272  -  272.5°C.  These  results  indicated  that  the  required  product  can  only  be  obtained  by 
the  use  of  PPSE. 

The  product  obtained  by  the  use  of  PPSE  was  then  treated  with  cuprous  cyanide  to 
replace  the  bromine  with  cyano  group.  IR  and  mass  spectroscopy  confirmed  the  structure 
of  the  expected  dicyano  product.  The  dicyano  product  was  then  hydrolyzed  using 
phosphoric  acid.  Once  again  the  structure  of  the  required  product  was  confirmed  by  DR 
and  mass  spectroscopy. 

In  order  to  study  the  polymer  made  out  of  this  novel  system,  it  was  essential  to 
study  the  basic  polymer  system  made  out  of  4.4'-biphenyl  dicarboxylic  acid.  To  judge  the 
reactivity  of  this  compound,  it  was  decided  to  make  the  model  compound  using  4,4’- 
biphenyl  dicarbonyl  chloride  and  2-aminophenol.  The  reaction  was  carried  out  as 


20-11 


indicated  in  the  experimental  section  and  the  results  indicated  that  the  reaction  is  feasible. 
The  structure  of  the  product  was  confirmed  by  IR  and  Mass  spectroscopy. 

The  basic  polymeric  system  was  tried  using  4,4'-biphenyldicarbonyl  chloride  and 
diaminodithiophenol.  The  reaction  product  indicated  that  the  reaction  is  feasible  and 
further  characterization  study  is  needed  to  confirm  the  structure  as  well  as  the  properties 
of  the  resultant  polymer. 

Thus,  these  reaction  sequences  indicated  that  the  new  type  of  dicarboxylic  acid 
can  be  synthesized  and  it  can  be  utilized  for  the  synthesis  of  novel  polybenzoxazole  and 
polybenzthiazole  that  can  exhibit  better  compressive  strength. 
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Reaction  Scheme  1:  Monomer  synthesis 
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Reaction  Scheme  2:  Polymerization  reactions 


2.  Copolymer  using  terephthaloyl  chloride  as  the  third  monomer. 


3.  Homo  and  Copolymer  of  benzoxazole  type. 

4.  Homopolymers  using  4,4'-biphenyl  dicarboxylic  acid. 
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Figure  1:  Computer  simulation  of  the  model  compound  showing  non-planar  biphenyl  unit. 
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Figure  5:  FTIR  spectrum  of  the  cmde  4,4'-dicarboxy-2,2’-bis(4-methylbenzoxazolyl)biphenyl. 
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ABSTRACT 


Defense  system  requirements  have  shifted  from  performance  at  all  costs,  to  one  of  ensuring 
superiority,  yet  being  affordable.  As  a  result,  affordability  has  become  the  primary  focus  of  weapon 
system  procurement.  In  order  to  develop  affordable  products,  product  design  and  manufacturing 
process  design  must  be  considered  concurrently  and  early  in  the  design  phase.  One  approach  to 
developing  affordable  systems  is  to  combine  the  integrated  product/process  development  (IPPD) 
process  and  virtual  manufacturing  (VM),  or  "manufacture  in  the  computer."  However,  if  product  and 
process  design  issues  are  to  be  effectively  considered,  then  cost,  schedule,  and  risk  assessment  tools 
must  be  sensitive  to  changes  in  the  product's  physical  design  as  well  as  to  changes  in  the  manufacturing 
system.  Unfortunately,  such  tools  are  not  available  today.  However,  this  deficiency  provides  an 
opportunity  to  totally  rethink  or  "re-engineer/re-design"  manufacturing  cost  estimating  methodologies. 

But,  before  "designing"  and  developing  cost  estimating  methodologies  that  adequately  address 
affordability  issues,  effectively  support  the  IPPD  process,  relate  product  and  process  variables,  operate 
in  a  virtual  design  environment,  and  model  manufacturing  and  non-manufacturing  indirect  activities, 
the  progression  of  modeling,  analysis  and  costing  —  from  the  individual  features  of  the  product  being 
designed  to  its  impact  on  the  enterprise  —  needs  to  be  understood.  This  research  provides  a  framework 
that  defines  the  modeling  and  analysis  progression  from  product  features  to  an  enterprise  view  of  the 
production  of  the  product.  The  framework  provides  the  foundation  for  manufacturing-oriented,  design- 
directed  cost  estimating  (MODDCE). 
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A  FRAMEWORK  FOR  MANUFACTURING-ORIENTED,  DESIGN-DIRECTED 

COST  ESTIMATION 


Allen  G.  Greenwood 


INTRODUCTION 

In  the  21**  Century,  technologically  superior  Air  Force  weapons  systems  will  be  produced  for  half  of  the  cost  and 
fielded  in  half  the  time  through  a  fully  integrated  commercial,  military  and  organic  industrial  base.  The  existing 
fleet  will  be  upgraded  and/or  modified  to  maintain  its  superiority  with  the  same  reductions  in  cost  and  time. 

([1],  p.  i)  The  environment  for  today's  cost  estimator  and  analyst  is  certainly  very  challenging.  Computerization, 
software,  robots,  composites,  uncertainty,  and  integrated  systems  all  challenge  the  applicability  of  our  existing 
tools  and  techniques.  (R.  R.  Crum,  [2]) 

In  most  commercial  and  defense  industries  today,  the  focus  is  on  affordability.  Affordability  is 
defined  as  the  balance  among  requirements,  costs,  and  budgets  and  considers  both  product  performance 
and  cost.  A  technology  is  considered  affordable  if  it  meets  the  customer  s  requirements,  is  within  the 
customer's  budget,  and  has  the  best  value  among  available  alternatives  ([3],  p.  2).  This  paper  provides  a 
framework  for  addressing  both  product  and  process  cost  issues,  during  the  design  phase.  Before 
defining  the  framework,  this  section  briefly  provides  the  background,  context,  and  motivation  for  this 
research,  in  terms  of  the  current  military  design  and  manufacturing  environment  &nd  state  of  the  art  in 
manufacturing  cost  estimating. 


The  Military  Design  and  Manufacturing  (and  re-manufacturing)  Environment 

Any  management  system  is  affected  by  both  a  general  environment  and  an  operating 
environment.  The  general  environment  includes  basic  economic,  social,  political,  and  legal  pressures 
that  are  brought  to  bear  upon  the  firm  or  industry.  More  specifically,  companies  and  industries  must 
function  in  an  operating  environment  that  includes  customers,  suppliers,  competition,  government, 
technology  (both  internal  and  external)  and  the  organization  or  management  system  itself.  One  of  the 
primary  characteristics  of  today's  general  business  environment  is  change.  The  defense  industry,  like 
many  others,  is  experiencing  a  period  of  rapid  change.  The  change  in  the  DoD  s  missions  and  needs  is 
driven  by  a  shifting  and  uncertain  threat  environment,  e.g.  collapse  of  the  Soviet  Union,  strong 
downward  fiscal  pressures.  As  a  result,  affordability  has  become  the  primary  focus  of  weapon  system 
procurement.  Defense  system  requirements  have  shifted  from  performance  at  all  costs,  to  one  of 
ensuring  superiority,  yet  being  affordable.  DoD  is  adopting  a  more  balanced  cost  of  performance  view 
where  weapon  system  life  cycle  cost  is  viewed  as  an  independent  variable,  not  simply  fall-out  as  a 
dependent  variable."  [4] 

The  focus  on  affordability  has  led  to  widespread  acquisition  reform,  utilization  of  integrated 
product  teams  (IPTs),  application  of  an  integrated  product/process  development  (IPPD)  process  or 
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concurrent  engineering,  adoption  of  commercial  practices  in  military  production,  military  production  on 
commercial  lines,  emphasis  on  sustainment  (stretching  the  life  cycle  of  weapon  systems  through 
modification  and  re-manufacture),  etc.  Few  new  major  weapon  system  programs  are  expected  in  the 
foreseeable  future  and  those  programs  that  are  developed  will  mostly  be  procured  in  low  production 
volume  and  at  a  low  production  rate;  in  some  cases,  e.g.  missiles,  there  will  be  low  volume  and  a  high 
product  mix.  Many  weapon  system  components  have  no  commercial  counterpart  and,  due  to  the  highly 
complex  nature  of  the  components  and  their  low  production  volumes,  many  of  the  military's  needs  can 
not  "compete"  in  the  broader  global  market.  Obviously,  these  current  find  future  trends  all  present 
major  challenges  to  the  design  and  manufacture  of  superior  yet  affordable  defense  systems. 

Affordability,  a  key  metric  in  defense  system  acquisition  and  operation,  considers  both  cost  and 
performance  and  is  obtained  and  maintained  by  (1)  identifying  and  assessing  cost  drivers  early  in  the 
life  cycle,  (2)  considering  process  design  and  manufacturing  operations  early  in  the  product  life  cycle, 

(3)  effectively  linking  product  and  process  design,  especially  in  terms  of  cost,  schedule,  and  risk,  and 

(4)  managing  cost  and  risk  over  the  system's  entire  life  cycle.  While  affordability  assessments  need  to 
address  all  phases  of  the  product's  life  cycle,  and  the  costs  that  will  be  incurred  in  those  phases,  i.e.  life 
cycle  cost,  this  project  focuses  only  on  the  costs  incurred  in  the  production  phase  and  the  design  and 
operation  of  manufacturing  systems. 

One  vehicle  for  addressing  affordability,  and  other  issues,  is  through  IPTs  and  the  LPPD  process. 
The  IPTs  must  concurrently  consider,  either  in  person  or  over  computer  networks,  product  design  and 
manufacturing  process  design  in  terms  of  product  performance  and  cost  tradeoffs.  The  largest  payoffs 
result  from  effectively  utilizing  EPTs  as  early  as  possible  in  the  design  phase.  The  importance  of  early 
design  decisions  is  widely  recognized.  It  is  often  stated  that  roughly  70  percent  of  the  total  life  cycle  cost 
of  the  system  is  determined  during  conceptual  design  [5]. 

The  relationships  between  the  business  and  operating  environment,  the  IPPD  process,  the 
concurrent  consideration  of  product  and  process  design,  and  the  tradeoffs  between  cost  and 
performance  is  represented  graphically  in  Figure  1. 

The  cost/ performance  tradeoff  graphic  at  the  bottom  of  Figure  1,  a  variation  on  a  concept  put 
forth  by  Yoshimura  [6],  depicts  the  goal  of  design  as  seeking  higher  product  performance  and  lower 
process  cost;  ie.,  driving  the  design  toward  the  "efficient  frontier."  The  efficient  frontier,  the  edge  of  the 
feasible  region  in  Figure  1,  contains  the  Pareto  optimum  design  solutions  —  those  solutions  where  there 
exists  no  other  feasible  design  solution  that  will  yield  an  improvement  in  one  objective  without  causing 
degradation  in  the  other  objective.  Cost  estimating  methodologies  are  generally  grouped  into  four 
categories  [7]:  (1)  judgment  -  use  of  expert  opinion  of  one  or  more  qualified  experts  in  the  area  to  be 
estimated,  (2)  parametric  —  mathematical  expressions,  often  referred  to  as  cost  estimating  relationships 
or  CERs,  that  relate  cost  to  independent  cost  driving  variables,  (3)  analogy  -  use  of  actual  costs  from  a 
similar  existing  or  past  programs  with  adjustments  for  complexity,  technical  or  physical  differences,  to 


21-  4 


derive  a  new  system  estimate,  and  (4)  grass  roots,  also  referred  to  as  "engineering  build  up"  or  "detailed 
estimate"  are  performed  at  the  functional  level  of  the  Work  Breakdown  Structure. 


Figure  1:  Affordability  is  the  driving  force  in  today's  defense  manufacturing  environment. 


One  way  to  develop  affordable  products  is  through  virtual  manufacturing  (VM),  or 
"manufacture  in  the  computer."  VM  is  one  of  the  key  technologies  which  allows  defense  manufacturers 
to  go  beyond  the  assumptions  driving  the  historic  acquisition  process  because  it  provides  four 
fundamental  changes  for  defense  manufacturing  [8]:  (1)  VM  can  be  used  to  prove  the  production 
scenarios,  resulting  in  "pre-production  hardened  systems";  (2)  VM  can  support  the  generation  of  more 
reliable  estimates  of  production  costs  and  schedule  because  the  models  are  based  on  actual  processes, 
not  just  parametrics;  (3)  modeling  and  simulation  can  significantly  improve  production  flexibility,  hence, 
reducing  the  fixed  costs;  and,  (4)  reliable  predictions  of  costs,  risk,  and  schedule  can  substantially 
improve  the  decision  making  process  of  acquisition  managers. 

In  this  new  and  evolving  technology,  fabrication  and  assembly  of  any  product,  including  the 
associated  manufacturing  processes  and  all  variables  in  the  production  environment  from  shop  floor 
processes  to  enterprise  transactions,  are  modeled  and  simulated  in  an  integrated  computer  environment. 
VM  accommodates  the  visualization  of  interacting  production  processes,  process  planning,  scheduling, 
assembly  planning,  logistics  from  the  line  to  the  enterprise,  and  related  impacting  processes  such  as 
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accounting,  purchasing,  and  management  [9].  The  move  from  solid  models  to  virtual  environment 
enables  manufacturing  and  industrial  engineers  to  design,  test,  and  modify  manufacturing  processes 
without  physical  prototypes  or  production  interruptions.  It  allows  integrated  product  and  process 
development  (IPPD)  in  a  virtual,  3-D  world  that  is  understood  by  all  participants  in  the  product  and 
process  design.  ([10],  pp.  35-36) 

In  order  for  IPTs  to  effectively  consider  product  and  process  design  issues,  they  need  cost, 
schedule,  and  risk  assessment  tools  that  are  sensitive  to  changes  in  the  product's  design,  as  well  as  to 
changes  in  the  design  of  the  manufacturing  system.  Unfortunately,  such  tools  are  not  available  today.  A 
review  by  Thomas  [11]  of  papers  published  in  major  journals  in  the  past  five  years,  resulted  in  the 
following  conclusions:  (1)  research  in  manufacturing  cost  estimation  has  involved  rather  few  people  and 
as  a  result  there  are  no  broadly  applicable  mathematical  models,  (2)  there  is  widespread  dissatisfaction 
with  the  way  overhead  or  indirect  costs  are  handled,  and  (3)  the  lack  of  broadly  applicable  cost  models 
leads  to  few  efforts  directed  towards  their  integration  into  the  CAD/IPPD  systems. 

Many  cost  models  do  not  provide  the  capability  to  perform  meaningful  product/process  trades 
and,  in  fact,  may  lead  to  an  inefficient  improvement  focus.  For  example,  many  existing  cost  models  are 
driven  by  parameters  such  as  weight.  As  Dean  [12]  points  out,  "there  are  many  perceptions  of  cost 
which  people  believe  to  be  true  and  act  upon,  but  which  leads  us  in  the  wrong  direction.  ...  The  weight- 
based  cost  estimating  relationship,  as  are  many  other  common  perceptions  of  cost,  is  quite  misleading 
unless  fully  understood  from  a  cost  perspective  and  an  engineering  perspective."  This  is  illustrated  in 
Figure  2,  using  data  from  the  Manufacturing  2005  Program  ([13],  [14]).  In  this  case,  if  a  design  goal  is  to 
reduce  manufacturing  cost  of  the  horizontal  stabilizer  and  if  CERs  that  are  based  on  weight  are  used  in 
the  analysis,  then  the  design  focus  would  be  on  the  stabilizer's  skins.  As  seen  in  the  table  below,  each 
skin  comprises  approximately  one  third  of  the  weight  of  the  structure  and  accounts  for  about  one  fifth  of 
the  cost.  However,  the  ribs  account  for  more  than  one  third  of  the  manufacturing  cost  but  comprise  less 
than  ten  percent  of  the  weight. 

Based  on  the  research  conducted  during  this  project,  the  author  adds  the  following  issues  to  the 
list  of  concerns  with  the  current  state  of  manufacturing  cost  estimating  in  the  IPPD  process:  (1)  Cost 
model  drivers  need  to  be  more  closely  related  to  the  characteristics  of  manufacturing  processes  and 
operations.  The  process  needs  to  be  designed  as  much  as  the  product;  all  too  often  the  product  is 
designed  and  then  the  process  (manufacturing)  must  be  designed  to  accommodate  it  or  the  product 
must  be  redesigned  to  accommodate  the  process.  (2)  Cost  model  drivers  need  to  be  more  closely  related 
to  product  features.  (3)  Manufacturing  process  design  and  resulting  costs  need  to  be  considered  earlier 
in  the  design  of  the  product.  (4)  Product  and  process  design  must  be  sensitive  to  the  existing  or  planned 
manufacturing  state,  especially  in  an  agile  environment;  i.e.  a  design  may  change  based  on  current 
factory  utilization.  Cost  models  need  to  capture  the  dynamics  and  stochastic  nature  of  manufacturing. 
(5)  The  relationship  between  product/process  costs  and  indirect  costs  need  to  be  better  defined  and 
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understood.  Indirect  costs,  both  manufacturing  and  non-manufacturing,  need  to  be  "modeled"  so  that 
they  can  be  addressed  during  design.  (6)  Legacy  cost  system  data  are  often  not  tracked  at  the  level  of 
detail  needed  in  design.  (7)  Legacy  cost  system  data  are  often  based  on  processes  and  methods  that  are 
no  longer  used. 
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Figure  2:  Many  weight  -driven  cost  models  do  not  permit  meaningful  product/ process  trades. 


Despite  the  shortcomings  of  past  and  current  cost  estimating  methodologies  and  the  lack  of 
perceived  value  that  they  provide  to  the  IPPD  process,  there  are  research  efforts  underway  that  are 
attempting  to  mitigate  these  problem;  some  of  those  efforts  will  be  identified  later  in  this  report.  It  seems 
that  if  we  can  model  and  simulate  the  operation  and  behavior  of  a  product  and  process,  we  certainly 
should  be  able  to  model  and  simulate  a  product/process's  cost  behavior.  In  fact,  VM  provides  an 
opportunity  to  "re-engineer"  costing  methodologies.  We  need  to  freshly  examine  how  costing  should  be 
done  in  a  virtual  environment,  unencumbered  by  traditional  accounting  practices.  New  costing 
methodologies  need  to  be  designed  to  fit  into  tomorrow's  environment. 

But,  before  "designing"  and  developing  cost  estimating  methodologies  that  adequately  address 
affordability  issues,  effectively  support  the  IPPD  process  and  relate  product  and  process  variables, 
operate  in  a  virtual  design  environment,  and  model  manufacturing  and  non-manufacturing  indirect 
activities,  we  need  to  understand  the  progression  of  modeling,  analysis  and  costing  from  the  individual 
features  of  the  product  being  designed  to  its  impact  on  the  enterprise.  The  result  of  this  research  study  is 
a  framework  that  defines  the  modeling  and  analysis  progression  from  product  features  to  an  enterprise 
view  of  the  production  of  the  product.  The  framework  provides  the  foundation  for  manufacturing- 
oriented,  design-directed  cost  estimation  (MODDCE). 

THE  MODDCE  FRAMEWORK 

An  overview  of  the  manufacturing-oriented,  design  directed  cost  estimating  (MODDCE) 
framework  that  was  developed  as  part  of  this  research  study  is  provided  in  Figure  3.  The  framework  is 
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composed  of  five  levels  that  progress  from  a  hierarchical  decomposition  of  a  product's  features  through 
three  intermediate  steps  to  an  enterprise-level  model  that  links  product  features  with  manufacturing 
and  non-manufacturing  entities  throughout  the  enterprise.  The  highest  or  enterprise  level  provides  the 
most  comprehensive  analysis  capabilities  for  assessing  the  effect  of  changes  in  product  and  process 
design  on  the  enterprise  and  the  resulting  effect  of  those  changes  on  "total"  cost.  The  progression  to 
each  level  requires  the  completion  of  an  activity.  For  example,  in  order  to  move  from  the  hierarchical 
decomposition  of  a  product's  features  to  the  corresponding  manufacturing  elements  that  create  those 
features,  requires  an  activity  that  maps  or  associates  features  with  transformation  elements.  Each  of  the 
activities  that  link  the  five  levels  are  described  in  the  following  sub-sections. 


Figure  3:  MODDCE  framework  provides  the  basis  for  "designing"  and  integrating  costing 
methodologies  that  link  product  features  to  an  enterprise  view  of  the  production  of  the  product. 


A  preliminary  and  partial  data  model  for  implementing  the  MODDCE  framework  is  provided 
in  Figure  4.  The  data  model  is  described  in  the  framework  subsections  that  are  most  closely  related  to 
each  data  model  entity.  As  a  matter  of  notation,  lines  with  no  double-headed  arrows  indicate  a  one-to- 
one  relationships  between  entities,  a  double-headed  arrow  on  one  end  represents  a  one-to-many 
relationship,  and  arrows  on  both  ends  represents  a  many-to-many  relationship.  For  example,  there  is  a 
many-to-many  relationship  between  suppliers  and  items  -  an  item  may  be  obtained  from  multiple 
suppliers  and  a  supplier  may  provide  many  items  to  the  company.  Also,  there  is  an  implied  inheritance 
of  information  from  entity  to  entity,  as  one  moves  from  left  to  right  in  Figure  4;  i.e.,  a  manufacturing 
process  object  inherits  all  of  the  information  about  manufacturing  operations,  manufacturing  elements, 
and  product  feature  characteristics. 
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Figure  4:  Preliminary  and  partial  data  model  for  the  MODDCE  framework. 


Mapping  product  features  to  manufacturing  elements 

The  process  of  mapping  product  features  to  manufacturing  process  elements  is  illustrated  in 
Figure  5.  A  product  is  broken  down  into  assemblies  which  are  further  broken  down  into  items.  This 
breakdown  is  represented  in  the  data  model  in  Figure  4  as  a  many-to-many  relationship  between 
products  and  assemblies  and  another  many-to-many  relationship  between  assemblies  and  items.  Items 
are  the  lowest  level  of  product  definition  that  is  addressed  in  the  framework  and  are  considered  to  be 
either  fabricated  or  purchased.  The  example  in  Figure  5  shows  an  assembly  named  "widget"  that  is  a 
part  of  a  higher-order  assembly  and  is  composed  of  four  items,  two  fabricated  items  (strap  and  slab)  and 
two  purchased  items  (block  and  fastener).  Purchased  items  do  not  have  features;  their  only 
characteristic  is  a  part  number.  The  part  number  links  the  item  to  a  data  base  that  includes  such 
information  as  description,  physical  characteristics,  supplier  information,  etc.  Each  fabricated  item 
requires  two  types  of  information:  basic  data  on  the  item  —  such  as  part  number,  base  material,  size,  etc. 
-  and  a  list  of  features  and  a  corresponding  set  of  information  on  each  feature.  For  example,  in  Figure  5, 
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the  fabricated  item  "slab"  has  three  features,  two  holes  and  one  surface  characteristic.  Hole  features 
would  contain  such  information  as  diameter,  depth,  tolerance,  location,  etc.  and  surface  features  would 
contain  such  information  as  the  face  that  is  treated,  purpose  of  treatment,  material,  etc. 

Once  all  of  the  items  have  been  defined  and  their  features  specified,  a  manufacturing  element 
that  creates  the  feature,  either  by  transforming  the  shape  of  the  item  or  altering  its  properties  (non¬ 
shaping),  is  specified.  There  is  a  one-to-one  relationship  between  a  feature  and  an  element.  Elements  are 
specified  from  a  taxonomy  of  manufacturing  processes,  e.g.  [15].  The  element  specification  may  be  for  a 
specific  manufacturing  process  or  for  a  more  general  process  family.  The  choice  of  a  manufacturing 
process  is  dictated  by  various  considerations  ([16],  p.  1232):  characteristics  and  properties  of  the 
workpiece  material,  shape,  size,  and  thickness  of  the  part,  tolerances  and  surface  finish  requirements, 
functional  requirements  for  the  expected  service  life  of  the  item,  production  volume  and  schedule,  level 
of  automation  required  to  meet  production  volume  and  rate,  costs  involved  in  individual  and  combined 
aspects  of  the  manufacturing  operation.  Numerous  guidelines  are  available  on  the  capabilities  of  various 
manufacturing  processes  for  producing  items  from  different  materials,  shape  characteristics,  tolerance 
and  shape  characteristics,  etc.  Some  guidelines  are  incorporated  into  CAD  software  to  provide  real-time 
feedback  to  the  designer.  Cost  guidelines  are  also  available,  usually  on  a  relative  basis.  All  too  often 
these  guidelines  represent  general  trends  and  are  not  related  to  specific  processes  and  product  features. 
Few  guidelines  are  available  on  the  cost  impact  of  production  volume,  rate,  lead  time,  etc.  These  are 
factors  that  are  becoming  increasingly  important  to  address  during  design  especially  when  fewer  new 
defense  systems  are  being  procured  and  there  is  a  growing  need  to  sustain  and  re-manufacture  systems 
that  are  no  longer  being  procured. 

Using  the  Todd  et  al.  taxonomy  [15],  a  hole  requires  a  shaping  manufacturing  process  that  could 
be  either  mass  reducing  or  mass  conserving.  If  a  hole  is  created  by  mechanical  mass  reduction,  further 
specification  would  require  consideration  of  manufacturing  processes  that  are  chip-oriented  (e.g. 
drilling)  or  shear-oriented  (e.g.  blanking).  Alternatively,  a  hole  could  be  created  through  a  mass 
conservation  process  which  may  involve  a  consolidation  process,  such  as  casting,  or  a  deformation 
process,  such  as  forging.  As  shown  in  Figure  5,  for  the  widget  example,  the  hole  features  in  the  slab  and 
straps  are  specified  very  generally;  they  are  only  identified  as  being  mass  reduction  elements.  On  the 
other  hand,  the  surface  property  of  the  slab  in  Figure  5  is  specified  more  precisely  as  being  a  charge- 
transferred  spray  coating.  Note  that  at  this  level  of  analysis,  even  if  the  methodology  used  to  create  the 
feature  is  precisely  specified,  such  considerations  as  setup,  simultaneous  operations,  resource 
availability,  etc.  are  not  considered. 

Specification  of  manufacturing  elements  provide  the  first  opportunity  for  cost  assessment. 
However,  in  order  to  provide  a  cursory  assessment  of  cost,  a  default  manufacturing  process  would  have 
to  be  associated  with  each  element,  e.g.  the  manufacturing  process  drilling  could  be  the  assumed 
process  used  to  create  a  hole.  Since  each  element  is  discrete  and  disjoint,  it  is  more  appropriate  to  begin 
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costing  at  the  next  level,  the  operations  level.  The  transition  to  the  operations  level  requires  combining 
elements  related  to  each  feature  into  manufacturing  operations;  this  activity  is  described  in  the  following 
section. 


Combining  manufacturing  elements  into  operations 

The  second  activity  combines  manufacturing  elements  for  each  feature,  as  described  above,  into 
manufacturing  process  operations.  An  illustration  of  this  activity  using  the  widget  example  is  provided  in 
Figure  6.  The  oval-shaped  symbols  in  the  figure  represent  manufacturing  operations.  As  can  be  seen  in 
Figure  6,  the  strap  item  is  produced  by  stamping  -  the  base  material  is  cut  to  size  and  the  two  holes  are 
formed  in  one  step  using  the  stamping  process.  Note  that  the  creation  of  several  feature  elements  have 
been  combined  in  order  to  form  the  stamping  operation  and  that  the  manufacturing  process  for  creating 
the  features  has  become  more  specifically  defined,  from  the  general  mass  reduction  process  family 
specified  for  the  manufacturing  elements  to  the  particular  process,  stamping,  specified  as  a  process 
operation.  It  is  at  this  level  that  specific  equipment,  labor  skill  requirements,  and  tooling  are  designated, 
as  illustrated  in  the  data  model  in  Figure  4,  where  the  manufacturing  operations  are  linked  to 
equipment,  labor,  and  tooling  data  files.  It  is  also  at  this  level  where  precedence  among  elements  and 
operations  is  defined.  The  dashed  lines  in  Figure  6  indicate  precedence;  for  example,  for  the  slab  item, 
the  base  material  is  cut  first,  then  painted,  and  finally  the  holes  are  created  through  drilling. 
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Based  on  the  research  conducted  during  this  study,  this  level  of  analysis  is  the  current  state  of 
the  art  in  cost  estimating  methodologies  that  directly  link  product  features  to  process  characteristics. 
Development  of  this  capability  is  a  part  of  two  JSF/JAST  projects.  The  Joint  Strike  Fighter  0SF)  program, 
formerly  the  Joint  Advanced  Strike  Fighter  (JAST)  program,  is  chartered  to  develop  an  affordable  next- 
generation  strike  weapon  system  for  the  Navy,  Marine  Corps,  Air  Force,  and  our  allies.  Both  projects  - 
JMD,  JAST  Manufacturing  Demonstration  (prime  contractor:  Hughes  Aircraft  Company)  and  SAVE, 
Simulation  Analysis  Validation  Environment  (prime  contractor:  Lockheed  Martin  Tactical  Aircraft 
Systems,  LMTAS)  -  link  feature-based  CAD  software  ( Pro/E  and  CATIA,  respectively)  to  Cognition 
Corporation's  Cost  Advantage  (CA)  software  system.  CA  is  a  design  for  manufacturing  expert  system 
that  provides  design  guidance,  manufacturing  alternative  analysis,  produdbility  analysis,  and  predictive 
cost  analysis.  CA  uses  process  models  that  are  comprised  of  information  and  rules  to  estimate  costs  and 
advise  engineers  on  the  manufacturability  of  product  designs.  As  part  of  this  research  program,  the 
author  met  with  engineers  from  Cognition  Corporation  and  LMTAS  [17]  in  order  to  better  understand 
their  development  efforts. 

However,  in  order  to  have  a  complete  virtual  manufacturing  environment,  the  cost  estimating 
capability  must  be  extended  beyond  this  process-level  analysis  to  the  system  level  and  ultimately  to  the 
enterprise  level.  The  following  two  sections  describe  these  levels. 
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Figure  6:  Combining  elements  into  process  operations. 
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Transforming  process  operations  into  manufacturing  process  objects 

The  third  activity  moves  the  modeling  and  analysis  capability  from  the  process  level  to  the 

system  level  by  transforming  discrete  process  operations  into  system-level  manufacturing  process 
objects.  It  is  at  this  level,  and  through  the  use  of  the  objects,  that  manufacturing  process  operations  are 
placed  in  the  context  of  an  actual  production  environment  and  the  product  is  integrated  into  the  fabric 
of  a  manufacturing  environment.  As  a  result,  the  objects  become  the  basis  for  the  design  of  production 
operations  and  facilities  by  considering  such  things  as  product  movement,  storage  and  handling  issues, 
resource  availability,  interference  with  other  products,  grouping  of  elements  into  cells,  etc.  The 
combining  of  process  operations  into  objects  is  analogous  to  implementing  process  plans  into  the 
production  environment.  The  application  of  process  objects  is  illustrated  by  the  facility  layout  in  the 
middle  of  Figure  7.  In  this  case,  continuing  with  the  widget  example,  the  strap  item  is  created  in  the 
press  area;  the  slab  is  created  by  slitting,  is  then  painted,  and  fabrication  is  completed  in  the  cell  prior  to 
assembly  with  the  other  items. 

The  symbols  at  the  bottom  of  Figure  7  illustrate  the  differences  between  process  operations  and 
process  objects.  Process  operations  consider  both  the  procedures  (manufacturing  steps)  and  the  human 
and  equipment  resources  that  are  required  to  perform  the  specific  manufacturing  activities.  For 
example,  the  stamping  process  operation  identifies  the  type  of  machine  and  operator  that  are  needed  to 
do  the  stamping,  as  well  as  the  time  required  per  item.  As  shown  in  the  process  object  symbol  in 
Figure  7,  objects  also  consider  procedures  and  resources  but  in  addition  they  consider  movement  and 
storage  of  the  product  between  and  within  objects,  machine  and  operator  capacities,  programmatics, 
interaction  with  other  products,  and  interaction  among  resources.  Also,  process  operations  are  static, 
whereas,  process  objects  are  dynamic  and  stochastic  -  both  of  which  are  characteristics  of  most 
manufacturing  systems.  That  is,  process  objects  are  time  sensitive  and  have  the  ability  to  incorporate 
random  events.  Process  objects  obtain  most  of  their  power  from  imbedded  methods,  such  as  scheduling 
algorithms,  simulation,  etc. 

It  is  only  through  manufacturing  process  objects  and  the  interaction  among  objects  that 
manufacturing  performance,  including  costs,  can  be  evaluated.  The  object  provides  output  regarding 
both  the  expected  value  and  distribution  of  such  measures  as  completion  times,  throughput,  operator 
and  machine  utilization,  work-in-process  inventory,  space  requirements,  etc.  System-level  analysis 
provides  a  means  to  identify  bottlenecks  and  other  potential  problems  before  the  product  is  committed 
to  production.  It  is  only  at  this  level  that  the  impact  of  a  product's  characteristics  on  production  process 
can  be  assessed  and  evaluated.  This  is  especially  important  in  an  agile  environment  or  for 
accommodating  small  batches  of  dissimilar  products  (e.g.  items  for  the  defense  industry  be  produced  on 
commercial  lines).  In  fact,  the  design  of  the  product  may  change  based  on  the  current  state  of  the 
manufacturing  facility,  e.g.  available  resources,  supplier  constraints,  etc. 


21-  13 


Figure  7:  Transforming  operations  into  manufacturing  process  objects. 


The  manufacturing  process  object  is  the  next  step  beyond  the  current  state  of  the  art  in  the 
development  of  a  virtual  manufacturing  environment  that  directly  links  product  features  to  enterprise- 
level  analyses.  Therefore,  a  more  detailed  examination  of  the  process  object  is  provided  in  Figure  8. 
Arrivals  to  an  object  may  be  either  individual  items  or  batches,  scheduled  or  random,  and  either 
originate  external  to  the  system  being  modeled  or  from  another  manufacturing  process  object.  Process 
objects  consider  moves  and  storage  to/ from  the  object  and  within  the  object.  In  the  example  in  Figure  8, 
an  arrival  requires  a  machine  and  an  operator  and  may  have  to  wait  for  either  resource  or  for  the 
appropriate  tooling,  additional  materials,  etc.  The  object  typically  includes  rules  for  selecting  resources 
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or  for  the  resources  to  select  products.  Activity  times,  both  machine  and  operator,  are  stochastic  and 
sampled  from  probability  distributions.  Also,  resources  used  by  objects  are  not  always  available,  due  to 
scheduled  or  unscheduled  maintenance,  shift  schedules,  etc.  Objects  also  include  the  modeling  of  the 
disposition  of  scrap  and  re-work.  It  is  only  through  the  dynamic  and  stochastic  capabilities  of  the 
manufacturing  process  object  that  a  system's  performance  can  realistically  be  evaluated  and  virtual 
manufacturing  can  truly  be  achieved. 


sched. 

unsched. 


Tapir 

•,b*teh 


Figure  8:  Example  of  a  manufacturing  process  object. 


Linking  manufacturing  process  objects  to  enterprise  processes 

The  highest  level  of  cost  analysis  is  at  the  enterprise  level.  Enterprise-level  costing  extends 
design  sensitivity  beyond  direct  manufacturing  costs  to  include  manufacturing  and  non-manufacturing 
indirect  costs.  The  impact  of  changes  in  product/ process  design  on  indirect  entities  in  the  enterprise  is 
assessed,  and  conversely,  the  impact  of  changes  in  indirect  entities  on  manufacturing  processes,  and 
hence  on  the  product,  is  also  assessed.  For  example,  specification  of  a  new  material  during  product 
design  will  not  only  affect  operations  in  terms  of  the  type  of  equipment  used,  labor  skills,  tooling,  scrap 
rate,  etc.  but  will  also  affect  numerous  indirect  entities  in  the  organization,  including  training, 
purchasing,  warehousing,  etc.  Normally  these  effects  are  assumed  to  be  captured  in  the  application  of 
very  general  overhead  rates.  However,  enterprise-level  costing  permits  a  more  realistic  assessment  of 
the  impacts  of  product  and  process  alternatives.  In  order  to  realize  enterprise-level  cost  estimation,  the 
firm's  business  processes  must  be  identified,  modeled,  and  linked  to  the  appropriate  manufacturing 
process  object(s).  This  is  illustrated  in  Figure  9  with  the  links  between  various  functional  entities  in  the 
organization.  Also,  as  illustrated  in  Figure  9,  enterprise  is  defined  in  its  broadest  context  and  indudes  all 
non-manufacturing  entities  in  the  company  (administration,  purchasing,  etc.),  as  well  as  external  entities 
such  as  customers  and  suppliers. 
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Figure  9:  Enterprise-level  analysis  links  indirect  entities  with  direct  manufacturing  objects. 


An  example  of  a  model  of  a  business  process  is  provided  in  Figure  10;  the  supplier  management 
and  procurement  process  was  developed  as  part  of  the  Manufacturing  2005  Program  [13].  The  process 
model  illustrates  the  linkages  between  manufacturing  and  such  entities  as  program  management, 
engineering,  quality  assurance,  the  supplier,  etc. 


As  mentioned  above,  enterprise-level  modeling  and  analysis  results  in  a  better  understanding 
and  a  better  definition  of  the  behavior  of  overhead  costs.  Overhead  costs  are  a  significant  component  of 
total  product  cost.  Raffi  and  Swamidass  [18]  report  that  in  an  average  U.S.  manufacturing  firm, 
manufacturing  overhead  cost  is  about  two  and  one  half  times  the  direct  labor  cost.  Thomas  [11]  reports 
indirect  costs  to  be  400  to  600  percent  of  direct  costs  in  aerospace  defense  production.  Despite  the 
significance  of  overhead  costs,  little  published  research  exists  in  this  area.  The  MODDCE  framework 
provides  a  foundation  for  modeling  non-manufacturing  processes  and  quantifying  the  effect  of  a 
process's  behavior  on  overhead  cost.  New  accounting  methodologies,  such  as  activity-based  costing 
(ABC),  will  be  required  in  order  to  model  and  link  direct  and  indirect  processes. 

As  part  of  this  research  project,  the  author  visited  the  Northrop  Grumman  Commercial  Aircraft 
Division  (NG-CAD)  in  Dallas,  TX  in  order  to  gain  a  better  understanding  of  the  costing  and  risk 
assessment  methodologies  used  in  the  Manufacturing  20005  Program,  Military  Products  using  best 
Commercial/ Military  Practices  (MP-C/MP)  [19]. 

Development  of  an  Application  of  the  MODDCE  Framework 

This  research  project  provides  a  framework  for  manufacturing-oriented,  design-driven  cost 
estimating.  Subsequent  research  will  develop  an  application  to  implement  the  framework.  This  section 
provides  a  brief  and  preliminary  definition  of  the  application. 

As  shown  in  Figure  11,  a  decision-support  application  that  implements  the  MODDCE 
framework  would  include,  at  a  minimum,  the  seamless  integration  of:  relational  and/ or  object-oriented 
database(s),  a  mechanical  CAD  package  for  product  design  (e.g.  Cognition  Corporation's  Mechanical 
Advantage,  Parametric  Technology  Corporation's  Pro/E),  process  and  cost  modeling  system(s)  (e.g. 
Cognition  Corporation's  Cost  Advantage),  manufacturing  process  objects  that  would  most  likely  be 
developed  from  manufacturing  and/or  object-oriented  simulators  (e.g.  AESOP  s  SIATPLE++,  ProModel 
Corporation's  ProModel),  and  business  modeling  and  simulation  software  (e.g.  CACI's  SimProcess, 
ProModel  Corporation's  ProcessModel,  AdvanEdge  Technologies  Corporation's  OPTIMA!).  These 
components  would  be  integrated  using  software  such  as  Microsoft  Visual  Basic  and/ or  Knowledge  Base 
Engineering's  IKE  ( Integrated  Knowledge  Environment).  The  application  would  be  developed  in  a 
Microsoft  Windows  environment,  most  likely  Windows  NT. 

Development  of  manufacturing  process  objects,  development  of  the  underlying  database 
structure,  modeling  of  business  processes,  integration  of  the  technologies  outlined  above,  and  design  of 
the  user  interface  and  decision-support  environment  would  be  required  to  effectively  link  product 
features  and  characteristics  to  manufacturing  and  enterprise-wide  processes.  The  application  would 
support  the  modeling  of  information  flow  as  well  as  product  and  material  flow.  Costing  methodologies, 
e.g.  activity-based  costing,  need  to  be  investigated  in  order  to  determine  their  applicability  to  this 
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environment  and  identify  the  best  way  to  integrate  the  costing  methodologies  into  the 
framework/  application. 


Figure  11:  Application  software  for  implementing  the  framework. 


FUTURE  RESEARCH  ISSUES 

As  defined  in  the  previous  section,  the  next  research  activity  is  to  begin  the  development  of  a 
prototypical  application  that  demonstrates  the  proposed  framework.  This  completed  prototype  will 
provide  a  laboratory  for  exploring  a  variety  of  issues  related  to  cost  estimating  in  an  integrated 
product/ process  development  environment  and  virtual  design  and  manufacturing  environments.  Some 
of  those  issues  include  specification  of  the  most  appropriate  costing  methodologies  for  assessing  both 
direct  and  indirect  costs,  the  application  of  activity-based  costing,  application  of  machine  learning, 
development  of  approaches  for  optimization  of  product/ process  design  via  simulation,  etc. 

Another  area  for  further  research,  which  would  directly  support  the  application  development 
defined  above,  is  the  generalization  and  extension  of  the  "innovative"  costing  work  performed  in 
support  of  the  Manufacturing  2005,  MP-C/MP  (Military  Products  Using  Best  Commercial/Military 
Practices),  Program  ([13],  [14]).  One  of  the  outcomes  of  the  MP-C/MP  Program  is  a  detailed  assessment 
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of  the  impact  of  numerous  business  practice,  manufacturing  infrastructure,  and  process  technology 
improvement  initiatives  on  product  and  process  costs.  The  program  clearly  demonstrates  that  indirect 
cost  categories  can  be  linked  to  product  and  process  characteristics.  The  MP-C/MP  Program  cost 
methodologies  provide  a  basis  for  process  modeling  and  linking  manufacturing  and  non-manufacturing 
activities. 

A  third  research  project  would  involve  the  development  of  an  industry  survey  or  workshop 
that  would  determine  and  define  the  cost  estimating  needs  for  a  virtual  design  environment.  The  project 
would  solicit  ideas  on  how  cost  estimating  should  be  performed  in  such  an  environment. 


CONCLUSIONS 

Realization  of  the  proposed  framework  would  advance  the  state  of  the  art  of  manufacturing  cost 
estimating  methodologies  in  the  following  ways:  (1)  Product  design  is  directly  linked  to  process  design 
and  manufacturing  operations,  thereby  achieving  "true"  virtual  manufacturing.  Shop-floor  and  above- 
the-shop-floor  assessments  are  performed  in  the  computer  before  commitments  are  made  to  produce  a 
product;  e.g.  potential  bottlenecks  can  be  identified  and  resolved  before  they  occur.  (2)  Product  and 
process  design  are  directly  linked  to  non-manufacturing  activities  in  the  enterprise,  resulting  in  a  more 
accurate  and  more  meaningful  assessment  of  indirect  costs.  (3)  Product/ process  designs  are  assessed 
and  "optimized"  at  the  system  level,  rather  than  sub-optimized  by  functional  area.  (4)  The  framework  is 
applicable  throughout  the  product  and  process  life  cycles,  thereby  providing  traceability  of  product 
design,  manufacturing  design,  and  cost.  It  also  provides  a  growing  knowledge  base  of  past  practices  and 
lessons  learned  to  be  used  in  future  designs.  (5)  The  aforementioned  knowledge  base  can  be  used  not 
only  for  human  learning,  but  as  a  basis  for  machine  learning  -  a  means  to  analyze  and  "optimize"  the 
relationships  between  product  features  and  the  processes  required  to  produce  those  features. 
(6)  Product/process  trades  are  made  on  "current"  planned  activities.  This  is  especially  important  as  the 
design  cycle  time  shrinks  -  current  activities  and  plans  for  the  facility  may  impact  the  selection  of 
product/process  design  alternatives. 
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Abstract 


This  research  was  sponsored  by  the  Air  Force  Office  of  Scientific  Research, 

Bolling  AFB,  DC  and  the  Air  Force  Wright  Laboratory  Air  Base  Technology  Branch 
(WL/FIVC-OL  Tyndall  AFB,  FL)  as  part  of  the  Summer  Faculty  Research  Program. 

This  research  was  designed  to  assist  Wright  Laboratory  Air  Base  Technology  Branch  in 
identifying  emerging  technologies  for  the  development  of  a  versatile,  air  mobile,  and 
environmentally  safe  waste  processing  system  to  support  force  deployment  operations.  In 
addition,  a  specific  thrust  of  this  researcher  is  to  develop  cost/benefit  methodologies  and 
to  incorporate  in  the  alternative  development  processes  those  aspects  of  engineering, 
socio-economic,  and  policy  issues  that  could  impact  the  cost  effectiveness  of  alternative 
solutions.  This  final  report  discusses  some  of  the  issues  that  International  Quality 
Standards  could  have  on  the  design  and  cost  effectiveness  of  Bare  Base  Waste  Disposal 
Alternatives.  Although  these  standards  are  still  under  development  and  there  are  not  yet 
definitive  policies  (14,  15)  on  how  the  DoD  will  integrate  these  standards  into  their  bare 
base  deployments,  a  DoD  wide  Environmental  Management  Systems  Committee  has  been 
formed.  (14,  15)  They  will  continue  to  evaluate  the  development  of  international 
standards  to  determine  if  they  present  an  “opportunity  for  DoD  to  improve  its 
environmental  management,”  with  a  “primary  motivation  for  (sic)  regulatory  relief  and 
reduced  costs.”  (14)  This  researcher  believes  these  standards  will  require  us  to  process 
waste  more  efficiently  and  environmentally  sound,  thereby  provide  solid  ground  to  support 
technology  innovation  under  development  and  proposed  by  Wright  Laboratories. 
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AFFECTS  OF  INTERNATIONAL  QUALITY  STANDARDS  ON  BARE  BASE 
WASTE  DISPOSAL  ALTERNATIVES 

Rita  A.  Gregory 

Introduction 

This  research  was  sponsored  by  the  Air  Force  Office  of  Scientific  Research, 

Bolling  AFB,  DC  and  the  Air  Force  Wright  Laboratory  Air  Base  Technology  Branch 
(WL/FIVC-OL  Tyndall  AFB,  FL)  as  part  of  the  Summer  Faculty  Research  Program. 

This  research  was  designed  to  assist  Wright  Laboratory  Air  Base  Technology  Branch  in 
identifying  emerging  technologies  for  the  development  of  a  versatile,  air  mobile,  and 
environmentally  safe  waste  processing  system  to  support  force  deployment  operation.  In 
addition,  a  specific  thrust  of  this  researcher  is  to  develop  cost/benefit  methodologies  and 
to  incorporate  in  the  alternative  development  processes  those  aspects  of  engineering, 
socio-economic,  and  policy  issues  that  could  impact  the  cost  effectiveness  of  alternative 
solutions.  This  final  report  for  the  Summer  Faculty  Research  Program  discusses  some  of 
the  issues  that  International  Quality  Standards-specifically  ISO  9000,  international 
standards  for  manufacturing  and  construction  and  ISO  14000,  international  standards  for 
environmental  management  systems  (1)— could  have  on  the  design  and  cost  effectiveness 
of  Bare  Base  Waste  Disposal  Alternatives.  Although  these  standards  are  still  under 
development  and  there  are  not  yet  definitive  policies  (14,  15)  on  how  the  DoD  will 
integrate  these  standards  into  their  bare  base  deployments,  a  DoD  wide  Environmental 
Management  Systems  Committee  has  been  formed.  (14,  15)  Members  of  this  committee 
believe  “ISO  14000  will  have  greater  impact  on  international  trade  than  the  ISO  9000” 
and  “ISO  14000  will  grow  in  importance  over  time.”  (14)  They  will  continue  to  evaluate 
the  development  of  international  standards  to  determine  if  they  present  an  “opportunity  for 
DoD  to  improve  its  environmental  management,”  with  a  “primary  motivation  for  (sic) 
regulatory  relief  and  reduced  costs.”  (14) 

The  U.S.  Army  Corps  of  Engineers  (COE)  say  the  quality  (management)  system 
outlined  under  ISO  9002  is  the  standard  most  applicable  to  construction  firms  in  the 
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private  sector  that  support  military  construction.  (16)  The  Louisville  (KY)  District  is 
attempting  registration  under  ISO  9000  and  is  tracking  the  development  of  ISO  14000. 
They  say  “Registration  to  the  ISO  9000  standard  demonstrates  to  our  customers  our 
commitment  to  quality  and  our  ability  to  meet  their  specified  requirements  and  to  possibly 
exceed  their  service  expectations.  .  .  .  ISO  9001  registration  is  a  practical,  commercial 
requirement  for  world-wide  business  organizations.”  (17)  The  ISO  standards  in  the 
United  States  are  known  as  ANSI/ASQC  Q9000  series.  (17) 

International  Standards  are  the  way  of  the  future  for  home  bases  and  contingency 
deployments.  This  researcher  believes  these  standards  will  require  Air  Force  Bare  Bases 
to  process  waste  more  efficiently  and  environmentally  sound,  thereby  provide  solid  ground 
to  support  technology  innovation  under  development  and  proposed  by  Wright 
Laboratories. 

Background/Discussion  of  Problem 

In  the  future,  under  the  doctrine  of  “Global  Reach...  Global  Power,"  the  USAF  is 
preparing  to  fight  world  wide  from  1 100-person  Bare  Bases  consisting  of  only  operating 
surfaces  and  a  source  of  water.  All  other  base  infrastructure  components  must  be  rapidly 
brought  via  airlift.  The  USAF  tasked  Sverdrup  Technology,  Inc.,  TEAS  Group  with 
developing  the  “Bare  Base  Infrastructure  Road  Map,"  identifying  deficiencies  and  defining 
promising  technologies  available  in  1996  to  2015  for  improving  current  austere  base  utility 
and  facility  equipment.  Inadequate  Waste  Management  Systems  was  one  of  the  top  ten 
deficiencies  identified  in  this  study.  Several  of  the  top  priorities  for  immediate  action 
recommended  by  the  study  were  to  quantify  and  characterize  waste  streams  and 
investigate  alternatives  for  waste  disposal  at  Bare  Bases.  (2)  The  Office  of  Scientific 
Research  sponsored  research  under  the  Summer  Faculty  Research  Program  was 
conducted  with  three  major  aspects:  quantifying  the  waste  stream,  literature  search  on 
emerging  technologies  and  commercial-off-the-shelf  (COTS)  for  waste  remediation,  and 
identifying  key  policies  or  elements  affecting  the  deployability  and/or  cost  effectiveness  of 
these  technologies.  Quantifying  and  characterizing  the  waste  stream  of  an  1 100-person 
deployed  unit  was  not  possible  because  there  is  no  such  exact  team  currently  deployed. 
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The  research  team  investigated  several  alternative  sources  for  corollary  data,  such  as  the 
Silver  Flag  training  site  at  Tyndall  AFB,  FL,  the  RED  HORSE  Squadron  at  Hurlbert  AFB 
and  their  data  from  a  recent  Bosnia  deployment,  and  the  Logistics  Civil  Augmentation 
Program  (LOGCAP)  data  form  the  Corps  of  Engineers  at  Winchester,  VA.  This  data 
search  is  described  by  the  Graduate  Student  Research  Final  Report,  base  page  number  16. 
The  technology  matrix,  with  accompanying  discussions  and  brain-storming  on 
employment  and  deployment  options,  was  provided  under  separate  cover  to  the  WL/FIVC 
project  officer.  The  final  aspect,  policies  and  or  elements  affecting  the  deployability 
and/or  cost  effectiveness  of  the  technologies,  focused  on  the  Global  emphasis  on 
international  specification  standards  and  developing  laws  that  can  restrict  operating 
locations  and  limit  equipment  usage  in  deployed  locations.  This  final  report  discusses  this 
third  aspect  of  this  summer  research. 

Methodology 

The  methodology  consisted  of  literature  research  on  International  Organization  for 
Standardization  (ISO),  emerging  laws  or  international  policies,  and  company/industry 
experiences  in  accepting  or  adapting  to  the  emerging  trend  for  international  standards.  In 
addition,  telephone  interviews  and/or  discussions  were  held  with  various  levels  of  the 
USAF  and  DoD  policy  makers  and  analysts  (15)  to  ascertain  an  impact  or  strategy  for 
incorporating  these  international  standards  in  design  or  costing  of  alternative  technologies. 
There  was  no  final,  published  DoD  or  USAF  policy  available  for  reference. 

Results 

The  International  Organization  for  Standardization  (ISO)  is  a  worldwide  federation, 
established  in  1947,  of  1 1 1  member  countries  represented  mainly  by  industry  and 
governmental  standards  groups.  (1)  Its  mission  “is  to  promote  the  development  of 
standardization  and  related  activities  in  the  world  with  a  view  to  facilitating  the 
international  exchange  of  goods  and  services,  and  to  developing  cooperation  in  the 
spheres  of  intellectual,  scientific,  technological,  and  economic  activity.  (ht.//www.iso.ch)” 
(1)  The  International  Organization  for  Standardization  (ISO)  defines  standards  as 
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“documented  agreements  containing  technical  specifications  or  other  precise  criteria  to  be 
used  consistently  as  rules,  guidelines,  or  definitions  of  characteristics,  to  ensure  that 
materials,  products,  processes,  and  services  are  fit  for  their  purpose.”  These  international 
agreements  are,  in  turn,  published  as  international  standards  or  incorporated  into  national 
standards.  (1)  International  concern  on  abuse  of  the  environment  is  a  rapidly  growing 
trend. 

Equally  important  in  setting  international  standards  is  the  European  Union  (EU). 
The  EU  population  of  375  million  is  made  up  of  Austria,  Belgium,  Denmark,  Finland, 
France,  Germany,  Greece,  Ireland,  Italy,  Luxembourg,  the  Netherlands,  Portugal,  Spain, 
Sweden,  and  the  United  Kingdom  (U.K.).  In  addition  to  the  EU,  companies  should 
anticipate  international  standards  certification  requirements  from  Norway,  “seven  nations 
in  Eastern  Europe,  three  Baltic  nations,  and  two  Mediterranean  nations  that  are 
negotiating  to  enter  the  EU  during  the  next  ten  years.”  (8)  "This  will  increase  the  EU 
population  to  500  million  with  an  additional  400  million  population  in  EU  Associate 
Member  States  in  North,  West,  and  South  Africa  and  the  Caribbean.”  (8) 

Bare  Base  deployment  is  inherently  international,  requires  international  or 
intercountry  agreements  (13),  and  will  be  affected  by  the  movement  on  global 
sustainability,  defined  by  the  Brundtlandt  Commission  as  “meeting  the  present  human 
needs  without  compromising  the  ability  for  future  generations  to  meet  their  own”  (1). 

ISO  9000,  international  standards  for  manufacturing  and  construction  and  ISO  14000, 
international  standards  for  environmental  management  systems  (1)  are  receiving  the  most 
visibility.  However,  other  movements  such  as  the  European  Union  Directive  on 
Packaging  and  Packaging  Waste  (11),  and  the  “1996  German  Takeback  Legislation”  (1) 
and  the  German  Topher  Law  (1)  will  have  major  impacts  on  deployment  equipment  and 
locations  for  Bare  Bases.  These  and  other  emerging  standards  and/or  legislation  will  be 
discussed  relative  to  the  impact  on  Bare  Base  deployments. 

ISO  9000,  the  international  standards  for  manufacturing  and  construction,  is 
“something  that’s  becoming  necessary  to  do  business  around  the  world.”  (1)  Many  of  the 
industries  that  provide  equipment  for  DoD  and  companies  that  support  through  service 
contracts  are  actively  seeking  certification  of  compliance  with  ISO  9000.  (12)  “They 
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realize  (sic)  that  compliance  is  (sic)  voluntary... but  ignoring  these  requirements  could 
prove  hazardous  to  their  marketshare.”  (3)  Defense  Logistics  Agency  is  examining  how 
ISO  14000  (and  presumably  9000)  will  work  in  its  business  environment,  since  85%  of 
their  purchases  are  from  small  businesses.  (14)  Under  current  certification  processes  small 
businesses  will  be  at  a  disadvantage.  One  reference  noted  for  a  small  business,  it  took  one 
year  to  achieve  certification”  (3).  The  Louisville  District  Corps  of  Engineers  anticipates 
12  to  18  months  to  achieve  certification.  (17)  European  Union  (EU)  is  now  legally 
requiring  CE  marking  on  all  products  sold,  imported,  or  used  in  their  countries.  (5,  8) 

“CE  is  the  French  abbreviation  for  Conformite  Europeene,  which  means  European 
Conformity  in  English."  (5)  A  mandatory  examination  and  certification  by  a  third  party 
officially  recognized  by  an  EU  Member  State  is  required  prior  to  affixing  the  CE  mark.  If 
a  product  is  not  marked  as  certified,  it  is  considered  an  illegal  product.  Depending  on  the 
host  nation  law,  it  can  be  sent  back,  detained,  confiscated,  fined,  or  forced  into 
compliance.  In  one  case,  it  cost  a  company  $48,000  to  reengineer  their  product  and 
resulted  in  six  months  delay  to  the  customer.  (9)  Nonconformance  penalties  are  set  by 
each  EU  Member  State  and  range  from  civil  fines  of  $62,500  per  infraction  in  Germany  to 
$5,000  and  three  months  in  prison  in  the  U.K.  (5,  9)  Potential  of  these  fines  must  be 
included  in  the  cost  of  doing  business  and  the  cost  effectiveness  of  alternatives,  but  more 
importantly  the  potential  of  deploying  forces  being  prohibited  from  entry  to  specific 
locations  must  be  a  major  element  in  choosing  waste  disposal  systems.  Under  the  USAF 
doctrine  of  Global  Reach-Global  Power  and  Bare  Base  concepts,  at  no  time  can  we  allow 
a  waste  disposal  system  be  the  limiting  element  for  deployable  locations. 

More  specific  standards,  identified  in  the  EU  Declaration  of  Conformity,  such  as 
the  EU  Electromagnetic  Compatibility  Directive  89/336/EEC  (6)  and  the  EU  Low 
Voltage  Directive  73/23/EEC  (7),  may  affect  specific  alternatives  such  as  plasma  torch 
technologies.  In  these  directives  “the  manufacturer  (of  all  parts  and  components)  must 
take  all  measures  necessary  in  order  that  the  manufacturing  process  shall  ensure 
compliance  of  the  products  with  the  technical  documentation  and  the  requirements  of  this 
directive.”  (7) 
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In  addition  to  compliance,  the  EU  requires  all  manufacturers  to  maintain  a 
technical  file  that  contains  technical  documentation  to  demonstrate  the  conformity  of  a 
product  with  the  essential  safety  requirements  of  a  directive  or  standard.  (10)  Under 
some  directives  the  a  manufacturer  can  “self-declare”  conformance  while  other  require 
third  party  certification.  The  Air  Force  Center  for  Environmental  Excellence  is 
considering  seeking  to  become  a  third  party  certified  for  ISO  14000.  The  Air  Force  is  not 
aware  of  any  private  companies  in  the  U.S.  that  are  currently  qualified  to  be  certifiers.  (14) 
Reducing  the  solid  waste  stream  for  Bare  Base  is  inherently  a  better  alternative  to 
disposal.  In  characterizing  the  solid  waste  stream  for  Bare  Base  (refer  to  the  Graduate 
Student  Research  Final  Report,  base  page  number  16  of  this  report),  operational  forces 
repeatedly  identified  packaging  and  shipping  products  as  a  major  source  of  the  waste 
stream.  The  EU  Directive  on  Packaging  and  Packaging  Waste  94/62/EC  (modeled  after 
the  German  packaging  ordinance- VERPACKUNGSVERORDUNG)  “covers  all 
packaging  and  all  packaging  waste  whether  it  is  used  or  released  at  the  industrial, 
commercial,  office,  shop,  service,  household,  or  any  other  level,  regardless  of  the  material 
used.”  (11)  “Packaging  means  all  products  made  of  any  materials  of  any  nature  to  be  used 
for  the  containment,  protection,  handling,  delivery,  and  presentation  of  goods,  from  raw 
materials  to  processed  goods,  from  the  producer  to  the  user  or  the  consumer.”  (11)  This 
directive  established  basic  targets,  quotas,  and  limitations,  that  are  reviewed  periodically 
for  upgrade,  for  the  member  states  and  people  operating  in  those  states.  EU  operates  on 
the  “polluter  pays”  principle.  “The  polluter  pays  principle  means  that  the  responsibility  for 
recycling  and  recovery  falls  upon  the  packager,  not  the  end  user.”  (11)  Similarly  the 
controversial  “1996  German  Takeback  Legislation”  requires  the  producer  to  “take  back” 
its  product  after  it  has  served  its  purpose— the  end  product  after  the  customer  no  longer 
uses  it.  This  law  implies  redesign  of  the  full  life-cycle  of  product  development,  plant  lay¬ 
out,  processes,  usage,  and  disposal.  These  laws  and  directives  will  make  obsolete  the 
current  Bare  Base  waste  management  alternative  of  using  host  nation  landfills-thus 
supporting  new  technology  alternatives  that  are  more  environmentally  friendly. 
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Impact  on  Wright  Laboratory  Technology  Thrusts  for  Bare  Base  Waste  Processing 

Air  Force  Civil  Engineers  directly  support  Global  Reach-Global  Power  and  mission 
readiness  by  providing  the  basing  systems  to  protect,  project,  and  support  air  and  space 
systems  and  forces.  (18)  In  the  stated  vision,  the  Air  Force  will  maintain  forward 
presence  through  contingency  deployment  and  will  provide  engineering  support  to 
domestic  defense  and  natural  disasters.  Operations  will  be  conducted  in  host  nations 
and/or  at  civilian  airports.  (18)  Future  projections  anticipate  up  to  90%  of  Civil 
Engineering  capabilities  will  be  met  with  commercial  off-the-shelf  technologies.  (18) 
Coordination/collaboration  of  research  and  development  efforts  is  accomplished  through 
teaming  arrangements  with  industry,  academia,  other  federal  agencies  and  foreign 
governments.  (18)  These  doctrine  and  issues  underscore  that  International  Standards  will 
affect  the  processes  of  development  and  the  products  of  development  from  Wright 
Laboratory. 

The  Next  Generation  Bare  Base  Waste  Processing  System  (18)  research  and 
development  project  is  an  example  of  emerging  technology  that  can  fulfill  both  the  fact 
and  spirit  of  International  Quality  Standards.  This  project  is  studying  the  deployable  bare 
base  waste  system  for  all  potential  waste  streams  with  the  goal  of  reducing  waste  volume, 
stabilizing  pollutant  and  disease  vectors,  and  recovering  heat  and  decomposition  gasses  as 
alternative  fuel  sources.  Plasma  vitrification  technology  uses  only  15%  of  air  required  of 
conventional  incinerators  and  produces  no  harmful  particulates,  nor  NOx/SOx  emissions. 
Exhausted  heat  from  plasma  vitrification  process  can  be  recovered  for  electricity 
production  or  prime  heat  adsorbtion  air  conditioning.  The  payoff  of  this  research  will  be 
improved  mission  readiness,  enhanced  environmental  and  health  benefits,  and  reduced 
impact  on  the  host  nation  landfill/site  cleanup.  (18) 

Conclusions 

The  growing  trend  in  international  standards  for  environmental  management  and 
manufacturing/construction  will  be  a  driving  factor  in  the  selection  of  methods  to  dispose 
of  Bare  Base  waste  streams.  Current  methods,  such  as  use  of  host  nation  land  fills,  will  be 


22-9 


prohibited  and/or  will  become  excessively  expensive  to  conform  with  international 
standard  environmental  requirements.  International  standards  will  have  an  impact  on  the 
cost  effectiveness  of  alternatives  of  Bare  Base  waste  disposal.  More  importantly,  from  an 
operational  perspective,  enforcement  of  international  standards  could  limit  which  countries 
are  willing  to  accept  U.S.  Bare  Base  forces,  unless  all  our  systems  conform  with  their 
compliance  requirements.  These  trends  will  become  the  ground  work  for  technology 
innovation  for  systems  that  are  more  efficient  and  environmentally  sound. 
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Abstract 


The  elastic  misfit  stresses  in  TiAl  alloys  assume  huge  level  of  the  order  of 
the  product  of  the  elastic  moduli  times  misfit  deformations.  The  elastic  energy 
associated  with  these  stresses  is  enormously  large  also  and  it  tries  to  relax  via 
one  of  several  possible  mechanisms.  Hence,  there  appears  a  driving  force  for  the 
rearrangement  of  the  material  elements  leading  to  the  evolution  accompanied  by 
the  creation  of  dislocations,  migration  of  vacancies,  development  of  interfacial 
corrugations,  etc... 

The  goal  of  this  research  project  is  two-fold:  i)to  develop  a  theoretical  ap¬ 
proach  which  permits  a  self-consistence  calculations  of  misfit  stresses  in  TiAl 
lamellar  multilayers  and  like  structures,  and  ii)to  develop  a  thermodynamic  ap¬ 
proach  allowing  one  to  study  the  possibility  of  stress  relaxation  by  means  of  mass 
rearrangement  in  the  vicinity  of  the  interface  between  the  lamellae.  We  assume 
that  the  rearrangement  is  accompanied  by  interfacial  diffusion  and  migration  of 
the  vacancies.  Since  the  full  consideration  of  the  coherent  interface  solid-solid 
includes  lots  of  calculation  and  details  we  illustrate  our  results  and  approach  by 
considering  the  much  simpler  but  conceptually  a  very  close  problem  of  mass  re¬ 
arrangement  in  the  vicinity  of  a  stress-free  interface. 
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MISMATCH  STRESSES,  LAMELLAR  MICROSTRUCTURE  AND 
MECHANICAL  PROPERTIES  OF  TiAI- ALLOYS 
Michael  Grinfeld 


1.  Introduction 

According  to  the  publications  of  Dimiduk  et  al  [3],  Hazzledine  et  al  [8]  and  others, 
the  Ti-Al  alloys  improve  almost  all  mechanical  properties  when  they  are  in  fully 
lamellar  microstructural  form.  This  is  why  this  form  of  the  microstructure  attracts 
so  much  attention  of  the  practitioners  and  researcher  dealing  with  TiAl-alloys. 
Many  fundamental  features  of  this  form  of  TiAI  alloys  remain  unexplored  and 
unclear.  One  of  them  is  the  magnitude  of  mismatched  stresses  in  the  lamellae, 
the  mechanism  of  mismatch  stresses  relaxation  and  the  influence  of  mismatch 
stresses  on  different  mechanical  properties.  The  source  of  these  stresses  is  the 
lattice  mismatch  between  the  a-  and  7-lamellae,  and  also  between  the  7-lamellae 
with  different  orientation.  The  crystallography  and  mismatch  deformations  in  the 
TiAl-lamellae  are  explored  pretty  well  nowadays  by  experimentalists,  and  it  seems 
to  be  one  of  the  urgent  necessities  to  develop  the  theory  allowing  to  calculate  the 
mismatch  stresses  with  the  help  of  information  relating  to  the  crystallography 
and  the  mismatch  strains.  It  is  the  central  issue  of  this  project  to  find  the  elastic 
misfit  stresses  in  the  TiAl-laminae.  We  have  established  rather  simple  compact 
formulae  for  the  stresses  within  the  lamellae  (on  certain  distance  from  the  external 
boundaries  and  the  edges). 

The  misfit  stresses  are  of  the  order  of  elastic  moduli  times  misfit  deformations. 
Since  the  misfit  deformations  have  enormously  high  magnitude  (as  compared  with 
the  deformations  in  conventional  macroscopic  solids)  the  misfit  stresses  and  cor¬ 
responding  elastic  energy  are  both  enormously  large  and  they  should  be  released 
via  different  mechanisms  of  stress  relaxation  including  proliferation  of  misfit  dis¬ 
locations,  migration  of  vacancies  and  dislocations,  development  of  interfacial  cor¬ 
rugations,  etc...  When  working  of  this  project  we  have  developed  i)some  of  our 
early  results  on  the  stress-driven  rearrangement  instabilities  of  the  solid-solid  in¬ 
terfaces  to  make  them  applicable  to  the  Ti-Al  lamellae  and  also  ii)some  of  out 
results  relating  to  the  slow  evolution  of  interfaces  in  pre-stressed  solids.  Though 
our  developed  approach  do  not  face  any  conceptual  obstacles  for  the  applications 
to  TiAI  alloys  we  present  here  some  preliminary  results  only  on  the  both  issues  i) 
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and  ii)  because  of  the  necessity  of  further  routine  calculations  and  analysis  and 
because  of  volume  limitations  for  this  report. 


2.  Stresses  in  laminated  elastic  structure  with  misfit 

2.1.  The  notion  and  geometry  of  misfit  deformation 

In  this  Section  we  give  a  simplified  linear  description  of  elastic  heterogeneous  sys¬ 
tems  with  coherent  interfaces  and  misfit  strains  (see  Dimiduk  et  al  [3],  [4],  Haz- 
zledine  et  al  [8],  Kad  et  al  [10]  for  the  crystallographic  description  and  physical 
motivation,  and  Grinfeld  [6]  for  the  more  general  situation  and  a  more  detailed  de¬ 
scription  of  the  mechanics  of  heterogeneous  solids  with  finite  misfit  deformations). 
Let  us  choose  one  of  the  unstressed  configurations  of  the  crystalline  substance  to 
be  the  reference  basis  for  the  lamellae  elastic  structure.  In  the  reference  config¬ 
uration  we  choose  some  affine  coordinate  system  x the  Latin  indices  run  the 
values  1, 2, 3;  the  summation  convention  over  repeated  indexes  is  implied  in  what 
follows.  We  remind  the  reader  that  each  affine  coordinate  system  has  straight  co¬ 
ordinate  axes  which  are  not  necessarily  perpendicular  to  each  other  (in  particular, 
any  Cartesian  coordinate  system  is  necessarily  affine):  in  the  following  we  specify 
the  choice  of  a  specific  coordinate  system  which  is  the  most  convenient  for  the 
structure  in  question. 

We  denote  by  nl  the  components  of  a  unit  vector  which  is  perpendicular  to 
the  planes  along  which  the  lamellae  match  each  other.  The  total  displacement 
within  L-th  each  of  the  lamellae  Ui  (xk)  can  be  splitted  into  two  parts 

Ui  ( xk )  =  A ij  xj+  m  (xk)  (2.1) 

L  L  L 

where  the  first  term  A .y  xj  is  associated  with  the  misfit  deformations  whereas  the 

L 

second  Ui  ( xk )  is  associated  with  the  additional  elastic  deformations.  We  assume 

L 

that  the  misfit  strain  Ay  itself  does  not  generate  any  stresses  in  the  sense  that  un- 

L 

der  this  strain  the  isolated  lamellae  does  not  accumulate  any  elastic  energy.  The 
misfit  strain,  however,  is  the  indirect  source  of  huge  elastic  stresses  within  the  lam¬ 
inated  structure  since  the  strains  A y  differ  very  significantly  (as  compared  with 

L 

conventional  elastic  deformations)  within  different  lamellae.  That  difference  in  the 
misfit  strain  for  different  lamellae  triggers  paramount  elastic  strains  dj  u{  (xk)  and 
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stresses  ^because  of  the  requirement  of  coherency  across  the  interfaces  between 

L 

different  lamellae.  In  what  follows  the  coherency  is  implied  as  the  continuity  of 
the  total  displacement  across  the  interfaces.  Using  the  notation  [A]^+J  of  the 
jump  of  the  quantity  A  across  the  interface  separating  the  lamellae  numbered  as 
K  4-1  and  K  we  can  present  the  coherency  condition  as  follows: 

M  £+1  =  [a#**  +  l*'  =  0  (2.2) 

If  we  choose  the  coordinate  system  in  such  a  way  that  the  coordinate  planes 
x 1  —  x2  (i.e.,  the  plane  x 3  =  const  )  coincide  with  the  interfaces  between  the 
lamellae,  the  equation  2.2  appears  to  be  equivalent  to  the  following  one: 

A„  +  a,uj(x‘)]*+',5‘  =0  (2-3) 

Here  and  in  what  follows  the  Greek  indices  run  the  values  1 , 2  and  the  summation 
convention  over  repeated  Greek  indices  is  implied;  the  ”  shift”  symbol  8ra  is  equal 
to  1  if  i  =  a  and  it  is  equal  to  zero  otherwise. 

The  equality  of  the  forces  across  the  coherent  interface  can  be  written  as 

(2-4) 

The  assumption  of  the  linear  elasticity  gives  the  following  formula: 

Pij=  cijkl  di  uk  (2.5) 

L  L  l 

where  the  elasticity  tensors  are  symmetric  with  respect  to  the  permutation  of 
the  indices  c^kl—  c?lkl=  c*iy=  ...  Combining  2.4,  2.5,  we  arrive  at  the  following 

L  L  L 

equations: 

[c^kldlUk\KK+1  nj  =  0  (2.6) 

Within  the  bulk  of  each  lamellae  the  following  equilibrium  equation  should  be 
satisfied: 

cijkl  dkdt  uk=  0  (2.7) 

L  L 

The  system  of  equations  2.3,  2.6,  and  2.7  allows  to  find  equilibrium  displace¬ 
ment  within  each  of  the  lamellae  Uj  (xk)  provided  the  appropriate  mechanical 

L 

conditions  at  the  external  interface  is  specified. 


23-5 


The  equilibrium  displacements  depend  significantly  on  the  "misfit”  strains  A* j 

and  the  elasticity  tensors  ctjkl.  The  following  two  experimental  observations  allow 
to  simplify  the  system  of  equilibrium  equation  significantly: 

i) the  elasticities  c^kl are  the  same  within  each  of  the  lamellae;  in  the  following 

consideration  we  use  for  them  the  notation  c^kl . 

ii) only  the  in-plane  components  AQ/3  of  the  misfit  strain  A#  deviate  from  zero; 

hence  L  L 

Aij  =  Aa/3  Sftf,  Aia  =  0  (2.8) 

L  L  L 

2.2.  Some  features  of  the  stresses  in  laminated  structures  with  misfit 

The  TiAl  alloys  often  have  a  laminated  structure  with  the  laminae  having  the 
in-plane  dimensions  much  greater  as  compared  with  their  thickness.  Because  of 
this  very  feature  some  general  conclusions  regarding  the  equilibrium  stress-strain 
fields  can  be  made  in  advance  (these  conclusions  can  be  made  on  the  basis  of 
both:  the  experiment  and  relevant  asymptotic  analysis  of  the  elasticity  equations). 
The  equilibrium  fields  can  be  splitted  into  two  parts:  i)the  uniform  stress-strain 
fields  on  certain  distance  from  the  external  boundary  and  the  external  edges 
comparable  with  the  thicknesses  of  individual  lamellae  (in  the  following  we  name 
it  the  ” internal”  field),  and  ii)the  edge  distortions  of  the  above  ” internal”  field 
which  appear  to  be  very  significant  in  the  vicinity  of  the  external  interfaces  (in  the 
following  we  name  it  the  ” external”  field).  The  conditions  at  the  external  interface 
cannot  change  the  very  fact  of  the  uniform  character  of  the  "internal”  field  though 
they  can  change  the  specific  parameters  of  the  ’’internal”  field;  needless  to  say, 
that  the  conditions  at  the  external  surface  play  a  crucial  role  in  the  formation  of 
the  ’’external”  stress-strain  field.  Both  -  the  ’’external”  and  the  ’’internal”  fields 
can  play  significant  roles  in  different  aspects  of  the  behavior  of  the  lamellae-like 
alloys. 

2.3.  Uniform  misfit  stresses  of  the  ’’internal”  field 

2.3.1.  General  system  of  equations  defining  strain-stress  field 

The  equilibrium  ’’internal”  uniform  displacements  within  the  plates  ut  (xk)  can 
be  written  as  follows: 

ui  ( xk )  =  Kij  xj  (2.9) 

L  L 
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where  is  the  unknown  matrix  of  the  equilibrium  gradients.  Inserting  2.9  in  the 
equations  2.3  and  2.6  we  get,  respectively: 


(2.10) 


J\ 


=  0  (2.11) 

The  system  of  algebraic  equations  2.10-2.11  includes  less  equations,  than  the 
amount  of  the  unknowns  This  system  includes  only  the  demands  based  on 


L 

the  mechanical  equilibrium  and  coherency  at  the  interfaces.  To  make  the  system 
self-sufficient  certain  additional  physical  ideas  should  be  implemented.  A  deeper 
analysis  shows  that  there  are  several  "natural”  extensions  of  the  system  2.10- 
2.11.  Here  we  shall  discuss  only  two  extensions  which  are  of  interest  for  TiAl 
multilayers.  One  of  them  concerns  the  "misfit”  stresses  in  multilayers  grown  on 
a  thick  substrate,  the  other  -  the  ” misfit”  stresses  in  the  multilayer  which  are 
free  from  any  external  support.  In  the  former  case  the  substrate  appears  to  be 
unstressed  because  of  its  thickness:  we  refer  the  misfit  stresses  in  this  case  as 
the  ” on-substrate”  misfit  stresses.  We  counterpose  these  ”  on-substrate”  misfit 
stresses  to  the  "free”  misfit  stresses  related  to  the  latter  case. 

The  case  of  the  ’’free”  misfit  stresses  is  more  difficult  for  the  calculations  and 
it  will  be  studied  first.  For  this  very  case  and  periodically  repeated  stocks  of  the 
lamellae  the  appropriate  asymptotic  analysis  shows  that  for  the  following  equation 
should  be  satisfied  [17]: 


N  N 

E  Hpls=  E  He 

K  K  *-_i  K  K 


.ijkl 


K=l 


K= 1 


Kkl 

K 


(2,12) 


where  H  is  the  thickness  the  A-th  lamella  within  the  period. 

K 

The  6(.A—  1)  equations  2.10,  3(N  —  1)  equations  2.11,  and  9  2.12  form  a  closed 
system  of  equations  with  respect  to  9 N  unknowns  By  using  the  equation  2.10 

K 

we  can  find  3 N  unknowns  corresponding  to  the  skew-symmetric  (rotational)  part 
of  the  gradients  k^i\-  Calculation  gives  the  following  formulae: 

K 


K[ij] 


K+ 1 
K 


Ma/3] 


K+ 1 
J  K 


SfS^  +  2  ]«(,«,! 


1  A'+l 


J  K 


(2,13) 


The  equation  2.13  allows  to  calculate  the  equilibrium  rotations  k  within 

LM 

each  individual  lamellae  provided  the  equilibrium  deformations  k  have  already 

L(ka) 
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been  determined  from  some  other  equations  exhibited  below.  The  equation  2.13 
shows  that  i)the  equilibrium  rotation  within  the  laminae  are  actually  generated  by 
the  rotations  in  the  misfit  deformations  and  by  the  rotations  associated  with  the 
deformations,  and  ii)the  equilibrium  displacements  can  be  determined  to  within 
an  arbitrary  rotation  of  the  sample  as  the  whole. 

For  the  6Ar  unknowns  corresponding  to  the  symmetric  (deformation)  part  of 

the  gradients  k (ki)  we  find  the  following  6N  equations: 

K 


q/3 ) 


•  K+l 
.  K 


=  -  A 


(a/3) 


K+l 

K 


'cijklK(kl) 


iK+1 


K 


n i  =  0 


N 


£  »/“««=» 

KK  K 


K=1 


The  equation  2.10  gives  us,  in  particular 


K(aj)  nj  -  0 

L 


(2.14) 

(2.15) 

(2.16) 


(2.17) 


and.  hence,  the  formula  of  the  equilibrium  rotation  can  be  simplified  to  read 


K\ij] 


K+l 

K 


K+ 1 
K 


(2.18) 


According  to  the  equation  2.18  the  equilibrium  rotation  is  generated  by  the 
rotation  in  the  misfit  only,  and  not  by  the  equilibrium  deformations. 


2.4.  Misfit  stresses  in  the  ’’free”  7/7  plate-like  periodic  structure  with 
isotropic  moduli 

In  the  case  of  the  7/ 7  lamellar  structure  the  elastic  moduli  of  the  lamellae  occur 
to  be  equal,  and  the  master  system  2.15-2.16  can  be  simplified  significantly  to 
read: 


4,»rv°  (2-19) 

N 

E  Hk{u)  =  0  (2.20) 

K= 1  K  K 
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Using  the  equations  2.14,  2.19,  2.20  we  find  the  following  formula  of  the  equi¬ 
librium  stresses  Plj within  the  L-th  lamella  of  the  7/7  laminated  structure  for  the 

L 

case  of  isotropic  lamellae: 

Pij  =  £  T  H  (~^skl6ij  +  26ik6jl  -  ^nV)  ( A(kl)  -  A(fc0)  (2.21) 
L  H^k\  1-v  l-V  }  \  k  l  ) 

where  \x  is  the  shear  modulus  of  the  substance.  Simple  analysis  shows  that  for  the 
case  under  consideration  the  in-plane  stresses  P°^only  (a  and  (3  run  the  values  1 

it 

and  2)  can  assume  non-zero  values.  For  those  very  components  the  formula  2.21 
can  be  simplified  to  read 

Pa0  =  ^f^H  F*  +  2d a^s)  (aw  -  Aw)  (2.22) 

L  H  ^  K  \  l-V  )  V  K  L  ) 


2.5.  Misfit  stresses  in  the  ’’free”  0/7  -  periodic  structure  (different  elas¬ 
tic  moduli) 

2.5.1.  a/ 7  -  periodic  structure  (2-layers,  isotropic  elastic  lamellae) 

When  considering  the  case  of  different  elastic  moduli  we  use  the  general  system 
2.14  -  2.16.  First,  we  consider  an  analytical  solution  for  the  case  of  the  isotropic 
elastic  moduli  and  a  periodic  cell  consisting  of  two  lamellae:  the  one  having  the 
a- like  structure  and  the  other  having  the  7-like  structure.  A  routine  calculation 
gives  the  following  result  for  the  stresses  within  the  lamellae: 


2  PPH 

?  122  j 

^  PH  +  PH 
11  22 

2  PPH 
7?  1  2  1 

PH  +  PH 
11  22 


/  \  2  PPH  ,  , 

l  a7  -  A7  |  6a/3  +  - ll±—  (  a<q/?>  _  A(q/?)  )[2.23) 

\  27  j  7  PH  +  PH  \  2  1  /  ' 


A7  -  A7 
2  1 


PH  +  PH  V 
11  22 

2  PPH 
121 

PH  +  PH 
11  22 


2 


A^®) 

1 


where  the  constant  R  is  defined  as 


Rp  = 


PH  (l+Sc)  C  +  PH  (l  +  3c)  C 
u_V  1/2  22V  2/1 _ 

PH  ( 1  +  3  Cj  (l+  cj  +  PH  (l  +  3  Cj  (l+  Cj  ’ 


1-2  v 


(2.24) 
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The  in-plane  deformations  within  the  lamellae  are  equal  to 

M 

k(q/3)  =  rJc-c)( A3,  -  8aP  +  — —  ( A^  -  A^\2.25) 

i  V 2  iyi2  1 7  RH  +  RHK  2  i  /  ' 

v  7  112  2 


-  -*•(?-?)(■ 


A1  -  AX )  <5a/3  - 


•7  -7 

2  1 


fJL 

LI _  ( 

+  rh  \  2  i  ; 


RH  +  RE 
11  2  2 


where 


Rk  = 


RRHH 
12  12 


w(1  +  3c)(1+c)+  w(l  +  3c)(i+c) 

For  the  out-plane  deformation  components  733)  we  get  the  formulae 


(2.26) 


(2.27) 


1  +  3  C)  CRH 
2  /  12  2 


kW  = 


m(1+3c)(i+c)+  W  (l+3c)(l+c) 

_ (1+3?)??? _ | 

ff(1+3?)(1+?)+;?  (1+3?)(1+?) 


AX  -  AX 


AX  -  AX 
.  2  -  1 


All  other  components  of  the  deformation  appear  to  be  zero. 

If  the  Poisson  ratios  of  the  phases  are  equal  -  v  =  v  =  u  -  the  formulae  2.24 

-  2.27  allow  some  simplifications;  in  particular,  we  get  the  following  formula  of  R 
in  this  case: 


Rp  = 


1  -  u 


(2.28) 


If,  in  addition,  the  shear  moduli  are  equal  -  R=H=  a  -  the  formulae  2.23  of  the 

i  i 

stresses  pa/3  reads: 

M 


nQ0  — 


2  ii  H 
2 


1  H  +  H  1 -v \ 
1  2 

P&P  — _ 1 _ 

2  H  +  H  1-V 
1  2 


AX  -  AX  I  6a0  + 


2  [iH 
2 

H  +  H 
1  2 


(2.29) 


\  2  fiH  , 

AX  -  AX  <5a/3 - (  A(q/3)  -  A(q/3) 

2'  1 7  H  +  H  V  2  1 
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The  formulae  2.29  appear  to  be  in  agreement  with  the  formulae  2.23  as  they 
should. 


2.5.2.  a/7  -  periodic  structure  (N-layers,  anisotropic  elastic  lamellae) 

The  analysis  of  this  most  general  case  is  based  on  the  system  2.14  -  2.16.  It  is 
convenient  to  introduce  the  following  matrices: 


L  K=N  La/3  K=N  ( 

T«j  =  Hca™s  ,  R  =  £  He* W  A  -  A 

KK  L-fSnuJ  KK  \K(lS)  l  (yS) 


(2.30) 


where  s  are  the  ” partial  1-2”  part  of  the  compliance  tensor  s  ;  ca/?7<5is  the 

•^7^7r<j  Lijkl  L 

inverse  of  s  ,  i.e.,  the  tensor  satisfying  the  relationship 

LySiUjj 


ca^ss  =  6Z6? 

L  (  U,) 


(2.31) 


In  terms  of  these  ’’material”  tensors  the  stress  tensor  within  the  L-th  lamella 

PaP  can  be  calculated  with  the  of  the  following  linear  system  of  equations: 

L 


L  L  a/3 

TaP  R 


(2.32) 


We  remind  the  reader  that  in  the  equations  2.30  -  2.32  the  indices  rim  the 
values  1  and  2  only  (not  3!). 

The  above  program  can  be  fulfilled  explicitly  in  the  case  of  isotropic  lamellae. 
Corresponding  calculations  give  the  following  formula  of  the  stresses  within  the 
L-th  lamella  of  the  spatial  period  including  N  lamellae: 

(2.33) 

n  [2  vHT  ~  fl-  v )  VF  F  1  /  \ 

P"s  =  f '  H  V  ( A<’">  -  A<’") ) 

1  jifci  "  (l-  A  2  T  (T  +  f)  T  '  1  \m  l  ) 


where  T  and  F  are  defined  as  follows: 

L  L 


-1  N  F  N  F 

t  =  oE¥tt.  f  =  Eirf 


V  —  V 
K  L 

A  (1- 


(2.34) 
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2.6.  Misfit  stresses  in  the  ” on-substrate”  periodic  multilayers  with  dif¬ 
ferent  elastic  moduli 

The  case  of  the  ”  on-substrate”  multilayer  is  of  the  primary  interest  in  many  ap¬ 
plications,  especially,  those  related  to  the  deposition  of  thin  films,  though  they 
are  of  a  secondary  interest  in  concerns  the  TiAl  lamellar  structure,  and  we  pay 
much  less  attention  to  it  in  this  report.  In  the  ” on-substrate”  case  the  extremely 
thick  substrate  does  not  deform  practically  and  it  can  be  treated  as  if  it  is  rigid. 
Because  of  this  fact  all  the  monolayers  of  the  multilayer  are  ’’aware”  of  pres¬ 
ence  of  the  substrate  only,  and  they  ’’know”  nothing  about  the  existence  of  other 
monolayers.  Therefore,  we  can  rewrite  the  system  2.14,  2.15 

«  =  -  Am)  (2.35) 

M(a0)  AT  '  V  ' 


,  ijkl 


M 


K 


M(Jd) 


rii  =  -  cijkl a 

J  M 


71a 


M(kl) 


(2.36) 


where  if  the  ’’misfit”  with  respect  to  the  lattice  of  the  substrate  A  .  The  system 

M  (hi) 

2.35,  2.36  appears  to  be  closed  with  respect  to  the  unknown  misfit  deformations 
M(u)  ^  sh°ws>  in  particular,  that  the  stresses  do  not  depend  of  the  lamellae’s 

thicknesses.  In  the  case  of  isotropic  lamellae  we  arrive  at  the  following  misfit 
deformations  and  stresses: 


K  =0, 

M(a3) 

K  =  "M  A“„ 

M(33)  1  —  I'm  m 

(2.37) 

P  =  —2 hm  i 

A^(a/3) 

{vm  A\  5a0  +  A(Q(9)j 

(2.38) 

3.  Role  of  vacancies  in  slow  evolution  of  interfaces 

The  elastic  misfit  stresses  in  TiAl  alloys  assume  huge  level  of  the  order  of  the 
product  of  the  elastic  moduli  times  misfit  deformations.  The  elastic  energy  asso¬ 
ciated  with  these  stresses  is  enormously  large  also.  Hence,  there  appears  a  driving 
force  for  the  rearrangement  of  the  species  leading  to  the  evolution  accompanied 
by  the  creation  of  dislocations,  migration  of  vacancies,  development  of  interfacial 
corrugations,  etc...  The  goal  of  this  Section  is  to  develop  a  thermodynamic  ap¬ 
proach  allowing  one  to  study  the  possibility  of  stress  relaxation  by  means  of  mass 
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rearrangement  in  the  vicinity  of  the  interface  between  the  lamellae.  We  assume 
that  the  rearrangement  is  accompanied  by  interfacial  diffusion  and  migration  of 
the  vacancies.  Since  the  full  consideration  of  the  coherent  interface  solid-solid 
includes  lots  of  calculation  and  details  we  illustrate  our  results  and  approach  by 
considering  the  much  simpler  but  conceptually  a  very  close  problem  of  mass  re¬ 
arrangement  in  the  vicinity  of  a  stress-free  interface. 

Mass  transport  through  the  bulk  of  solids  is  produced  by  the  migration  of 
vacancies  whereas  near  surfaces  the  migration  of  vacancies  is  accompanied  by 
surface  diffusion,  vaporization-sublimation,  etc.  (see,  for  instance,  Christian  [5]). 
The  simplest  and  most  robust  features  of  the  mechanism  of  surface  diffusion  has 
been  studied  by  Mullins  [15].  It  is  widely  accepted  that  this  mechanism  dominates 
over  migration  of  vacancies  near  traction-free  surfaces.  The  situation  can  change 
dramatically  at  the  nano-scale  (see,  for  instance,  Lee  and  Bennet  [14]).  At  this 
level  the  very  applicability  of  common  thermodynamics  seems  questionable.  Since 
at  the  nano-scale  it  is  difficult  to  distinguish  between  the  bulk  and  the  surface  part 
of  the  solid  it  seems  natural,  at  least,  to  withdraw  existing  macro-  and  micro-scale 
’’obvious”  pre-assumptions  and  to  explore  the  mechanism  of  interaction  between 
surface  diffusion  and  vacancy  migration. 

In  this  note  three  dissipation  mechanisms  are  taken  into  account:  i)dissip 
ation  from  the  flow  of  vacancies  at  the  surface,  ii) dissipation  from  bulk  diffusion 
of  vacancies,  iii)dissipation  from  surface  diffusion.  In  addition  to  the  bulk  energy 
4>  the  Herring-like  surface  energy  [9]  is  taken  into  account.  We  consider  a  slightly 
corrugated  solid  film  of  thickness  H  attached  to  a  rigid  substrate,  see  Figure.  It  is 
assumed  that  the  vacancies  can  penetrate  the  bulk  through  the  free  interface  E  of 
the  film  only.  We  denote  the  time  by  t,  the  spatial  coordinates  by  xk  (the  spatial  - 
Latin  -  indices  run  1,2,3,  and  summation  over  repeated  indices  is  implied).  x(xfc,  t) 
is  the  volume  concentration  of  vacancies  and  Ql(xk,  t)  is  the  vector  of  the  volume 
flux  of  vacancies.  We  denote  the  surface  coordinates  of  the  free  surfaces  by  ^  (the 
surface  -  Greek  -  indices  run  1,2);  and  qa((p,  t)  are  the  normal  velocity  of 

the  free  surface  and  the  surface  flux  of  the  species,  respectively.  Simple  kinematics 
(Mullins,  1957  [15];  Christian,  1975  [5];  Grinfeld,  1994  [7];  Berdichevsky  et  al. 
1996  [2])  leads  to  the  following  equations  of  mass  conservation  within  the  bulk 


dx  =  dg_ 

dt  dxl 


(3.1) 


and  at  the  free  surface 
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Qna 

c(l  -  x)  =  Q'n,  +  g|j  (3.2) 

where  rii(^,t)  is  a  unit  normal  to  the  surface.  The  first  term  on  the  right  hand 
side  of  Eq.  3.2  describes  the  migration  of  the  free  surface  from  the  flow  of  vacancies 
through  the  surface,  while  the  second  describes  the  migration  of  the  surface  from 
the  surface  diffusion. 

In  this  ’’kinematic  surrogate”  of  a  complete  thermodynamic  theory  we  ignore 
the  role  of  stresses  and  heat  and  suppose  that  the  energy  per  unit  volume  0  is  a 
function  of  the  vacancy  concentration  x,  whereas  the  surface  energy  per  unit  area 
is  a  positive  constant  a.  The  simplest  ” constitutive”  equations  of  the  volume  and 
surface  fluxes  compatible  with  the  requirement  of  energy  dissipation  read: 


(3.3) 

dZ0  i-x 

(3.4) 

where  D ^  and  Dafj  are  the  bulk  and  surface  positive  symmetric  diffusion  tensors, 
respectively;  k  is  a  doubled  mean  curvature  of  the  surface.  The  simplest  equation 
to  describe  the  flow  of  vacancies  Qln{  through  the  free  surface  compatible  with 
the  principle  of  energy  dissipation  reads 


+  (I  -  x)<f>x  - 

i  -  X 


(3.5) 


where  Dj  is  a  positive  constant  characterizing  the  resistance  of  the  solid  substance 
to  penetrating  of  vacancies;  <f>x  =  d(f)/dx-  For  the  sake  of  simplicity  we  assume 
that  no  vacancies  flow  through  the  film-substrate  interface  E: 


Q^i  =  0  (3.6) 

The  equations  3.1  -  3.6  form  a  closed  self-consistent  nonlinear  system  of  the 
differential  equations  and  boundary  conditions  for  describing  slow  evolution  of 
deformable  elastic  solids  with  vacancies. 

We  denote  by  x°  the  equilibrium  concentration  of  vacancies  within  the  film 
when  it  has  a  plane  free  surface  and  is  exposed  to  vacuum.  Hence,  according  to 
Eq.  3.5,  x°  satisfies  the  equation 

<Kx°)  +  (1  -  X°)<f>x(x°)  =  0  (3.7) 
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Let  us  assume  now  that  the  free  surface  E  is  slightly  corrugated.  The  evolution 
of  this  corrugation  can  be  explored  by  linearizing  the  equations  3.1  -  3.7  in  the 
vicinity  of  the  uniform  configuration  with  further  consideration  of  the  solutions 
of  the  form  ~  exp (rjt  +  ika£a)]  here  £Q  are  the  Cartesian  coordinates  on  the 
equilibrium  plane  surface,  ka  is  the  in-plane  wave-number  of  the  corrugation,  rj  is 
the  growth-decay  rate  of  the  corrugations. 

In  the  following  we  assume  that  the  film  is  isotropic  and  contains  a  small 
concentration  of  vacancies  y°  <C  1.  Hence,  the  diffusion  tensors  take  on  the  form: 
Dl°  =  Da@  =  DsSal3,  where  Db  and  Ds  are  positive  constants  whereas  <5tJ 

and  Sa/3  are  the  spatial  and  surface  Kronecker  symbols.  A  routine  computation 
results  in  the  following  dispersion  equation  for  this  case: 

+  ¥l^j+D, 

where  N  =  A2</;(x°)/<9y2. 

Eq.  3.8  shows  that  the  real  r]  are  necessarily  negative  as  they  should  be  in  the 
absence  of  destabilizing  factors. 

Iir]/(k2NDb)  «  1  we  can  derive  from  Eq.  3.8  the  following  dispersion  relation: 

3  DsDbk 2  tanh  ( kH )  +  DiDsk  +  DbDi 
V  =  ~Gk  Dbk  tanh  {kH)  +  Dr 

Eq.  3.9  shows  clearly  that  the  three  dissipation  mechanisms  are  not  additive 
but  interact  with  each  other.  It  is  quite  instructive  to  extract  from  Eq.  3.9 
Mullins’  A;4- term  by  rewriting  the  equation  in  the  form 


(r]  +  Dsak*)  +  DbDiak 3  =  0  (3.8) 


t]  =  —DscrkA 


_ 1 

tanh  (kH)  + 


(3.10) 


The  second  terms  in  the  brackets  gives  a  correction  to  the  ”fc4-law”  which  might 
even  dominate,  depending  on  the  magnitudes  of  the  nondimensional  numbers 
Dsk2/Dj,  Dsk/Db,  kH.  First  of  all  the  assumption  r)/{k2NDb )  1  gives  for  the 
Mullins  mode  the  following  short  wave-length  limitation:  Dsak2 /NDb  1.  The 
correction  term  approaches  zero  at  Dsk2/Dj  1  or  Dsk/Db  1.  For  thin  films 
the  asymptote  kH  — *  0  is  often  of  interest.  In  this  limit  Eq.  3.10  reads 

,  =  -D,ak‘  (l  +  (3.11) 
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and  for  sufficiently  long  wave  corrugations  with  k~x  >  Ds/Db  the  correction  term 
is  comparable  with  the  Mullins  k4- term.  The  A:3-term  appears  to  dominate  as 
k  — »  0  and  in  this  limit  the  rate  r)  approaches  —Dbak3.  The  k3  variation  also 
holds  for  thick  films  in  the  case  of  ’’instantaneous  kinetics”  (very  large  values  of 
Di).  We  notice  that  the  same  dispersion  equation  rj  oc  A;3  is  valid  for  the  transport 
mechanism  ’’vaporization-condensation”  which  is  presumably  more  effective  than 
the  vacancy  transport  at  macro-  and  micro-scale  level  but  the  situation  could  be 
opposite  at  the  nano-scale  level  because  the  values  of  Ds,  Db.  and  Dj  need  not 
be  the  same  as  they  are  in  macroscopic  bodies. 

It  is  well-known  that  stresses  within  the  solid  can  destabilize  flat  interfaces 
when  rearrangement  of  mass  is  allowed.  Assuming  that  the  film  is  isotropic  and 
it  is  sufficiently  thick  we  arrive  the  following  generalization  of  Eq.  3.8: 


(■ Dtk{1  +  p™;  +  A) *  +  P'12) 

(DkDi  +  DhDsk2)k2 +  DiDsk3  ak-(  l-v)—  =0 

.  L  . 

where  v  is  the  Poisson  ratio,  T  is  the  uniaxial  stress.  Eq.  3.12  can  be  presented 
in  the  following  dimensionless  form: 

(Vl  +  R  +  Kib)R  +  [y/l  +  R(Kib  +  Ksb)  +  KibKsb\  [1  -  (1  -  i/)$a]  f  =  0  (3.13) 
where  the  dimensionless  numbers  R,  Kiin  Ksb,  <3>a,  T  are  defined  as  follows: 


d  _  ^  is  _  ts  _  Dsk  _  _  T2  _  crk 

k2NDb  ’  _  Dbk ’  “  Db  ’  ®  auk ’  T~  N 


At  R  <C  1  we  get  the  following  nondimensional  dispersion  equation: 

_  Rib  -fi  K<:h  RfbKsb  r  /  .  _ 

r  =  f  [1  -  (1  -  i/)$a]  T 

1  +  Rib 


(3.14) 

(3.15) 


which  gives  a  consistent  generalization  of  the  dispersion  equation  of  Asaro  and 
Tiller  (1972)  [1],  compatible  also  with  the  static  stability  criteria  (see,  for  in¬ 
stance,  Grinfeld,  1991  [6];  Nozieres,  1991  [16])  though  it  differs  from  the  criterion 
of  Vasudev,  Asaro  and  Tiller  (1975)  [18]exploring  the  same  mechanism  of  rear¬ 
rangement  via  the  bulk  diffusion  in  pre-stressed  crystal.  The  possibility  of  stress 
driven  ’’necking” of  a  rod  caused  by  the  migration  of  vacancies  has  been  noticed 
by  Larche  and  Cahn  (1985)  [12], 
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4.  Conclusion 


4.1.  The  main  results  of  this  project. 

The  elastic  misfit  stresses  in  TiAl  alloys  assume  huge  level  of  the  order  of  the 
product  of  the  elastic  moduli  times  misfit  deformations.  The  elastic  energy  asso¬ 
ciated  with  these  stresses  is  enormously  large  also.  Hence,  there  appears  a  driving 
force  for  the  rearrangement  of  the  material  elements  leading  to  the  evolution  ac¬ 
companied  by  the  creation  of  dislocations,  migration  of  vacancies,  development  of 
interfacial  corrugations,  etc... 

The  goal  of  this  research  project  was  two-fold:  i)to  develop  a  theoretical  ap¬ 
proach  which  permits  a  self-consistence  calculations  of  misfit  stresses  in  TiAl 
lamellar  multilayers  and  like  structures,  and  ii)to  develop  a  thermodynamic  ap¬ 
proach  allowing  one  to  study  the  possibility  of  stress  relaxation  by  means  of  mass 
rearrangement  in  the  vicinity  of  the  interface  between  the  lamellae. 

We  have  developed  a  general  scheme  and  establish  several  simple  general 
formulae  permitting  to  evaluate  numerically  and  to  analyze  qualitatively  mis¬ 
fit  stresses  in  TiAl  multilayers.  On  the  basis  of  the  model  of  coherent  interfaces 
we  arrive  to  the  conclusion  that  one  have  to  carefully  distinguish  between  the  ’’on- 
substrate”  misfit  stresses,  on  the  one  hand,  and  the  ”free-edge”  misfit  stresses, 
on  the  other  case.  The  former  case  is  more  relevant  for  the  stresses  appearing  at 
different  processes  of  deposition:  the  misfit  deformations  and  strains- in  the  ’’on- 
substrate”  multilayers  can  be  calculated  with  the  help  of  the  master  system  2.35, 
2.36  (for  the  general  anisotropic  case)  and  simple  formulae  2.37,  2.38  for  the  case 
of  isotropic  monolayers.  The  ”free-edge”  misfit  deformations  and  stresses  can  be 
calculated  with  the  help  of  the  master  system  of  algebraic  equations  2.14  -  2.16 
for  the  arbitrary  anisotropic  lamellae  with  different  moduli.  Much  more  simple 
working  formulae  can  be  used  in  the  case  of  isotropic  lamellae: 

i  -  2.21,  2.22  for  the  case  of  equal  elastic  moduli  (N-layer  periodic  multilayer  in 

width), 

ii  -  2.23-2.29  for  the  case  of  different  elastic  moduli  (2-layer  period  multilayer  in 

width), 

iii  -  2.33,  2.34  for  the  case  of  different  elastic  moduli  (N-layer  periodic  multilayer 

in  width). 

When  studying  possible  mechanisms  of  stress  relaxation  we  assumed  that  the 
rearrangement  is  accompanied  by  interfacial  diffusion  and  migration  of  the  vacan- 
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cies.  Since  the  full  consideration  of  the  coherent  interface  solid-solid  includes  lots 
of  calculation  and  details  we  illustrate  our  results  and  approach  by  considering 
the  much  simpler  but  conceptually  a  very  close  problem  of  mass  rearrangement  in 
the  vicinity  of  a  stress-free  interface.  Using  one  of  the  approaches  of  irreversible 
thermodynamics  of  solids  we  explored  the  slow  evolution  of  corrugations  at  the 
free  boundary  of  a  crystalline  substance.  The  central  issues  of  this  study  are  the 
roles  and  the  interaction  of  the  three  dissipation  mechanisms:  i)the  surface  diffu¬ 
sion  of  the  species,  ii)the  bulk  diffusion  of  vacancies,  and  iii)the  flow  of  vacancies 
within  the  solid  from  vacuum.  For  the  flux  of  vacancies  from  vacuum  we  have 
postulated  the  constitutive  equation  3.5  which  can  be  easily  justified  on  the  basis 
of  Onsager  ideology  (this  means,  basically,  that  the  constitutive  law  for  the  flux 
should  obey  the  linearity  between  the  flux  and  corresponding  driving  force,  and 
it  has  to  provide  the  decay  of  the  total  accumulated  energy).  The  new  dissipa¬ 
tive  material  constant  Dj  appearing  in  this  modelling  should  be  determined  by 
quantitative  measurement  of  different  effects  relating  to  the  penetration  of  vacan¬ 
cies  within  the  solid  from  vacuum  (which  is  nothing  else  but  rearrangement  of 
the  species  accompanied  by  producing  new  vacancies  in  the  vicinity  of  the  free 
surface).  One  of  the  effects  of  this  sort  is  the  evolution  of  surface  corrugations: 
the  growth-decay  rate  of  the  corrugations  depends,  besides  other  parameters,  on 
the  value  of  D,.  We  have  derived  a  rather  simple  dispersion  equation  3.8  of  the 
growth  rate  of  the  surface  corrugation  T]  with  the  help  of  the  master  system  of 
slow  evolution  (Eqs.  3.1  -  3.6)  using  the  simplest  model  of  the  isotropic  elastic 
solid  with  vacancies  (Lifshitz,  1962  [13]).  The  equation  3.8  and  its  implications 

3.9  -  3.11  allow  one  to  find  those  corrugations  for  which  the  Mullins’  k 4  -  law  gives 
acceptable  approximation  and  those  for  which  it  does  not.  The  equations  3.8  - 

3.10  clearly  show  that  the  three  above  mentioned  dissipative  mechanism  cannot 
be  described  as  the  linear  superposition  even  for  small  disturbances  (as  they  are 
treated  usually  in  the  existing  literature). 

As  the  side  result  of  this  study  we  have  derived  the  dispersion  equation  3.12  of 
the  growth  rate  of  surface  corrugations  in  the  pre-stressed  crystal.  Our  dispersion 
equation  appears  to  be  consistent  with  the  known  thermodynamic  criterion  of  the 
stress  driven  rearrangement  instability  as  it  should. 

4.2.  The  most  promising  directions  of  future  research 

The  result  of  this  project  demands  further  studies  in  what  concerns  both  theory 
and  experiment.  The  following  directions  of  theoretical  studies  look  the  most 
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urgent  and  promising: 

I  Misfit  stresses  in  the  case  of  large  misfit  deformations  (actually,  only  the 
”in-plane”  misfit  deformations  can  be  supposed  small  geometrically  -  of  the 
order  of  several  %  -  in  the  first  approximation;  the  shear  deformations  are 
actually  extremely  big  of  the  order  of  hundred  %%). 

II  Misfit  stresses  in  the  plate-like  random  structure 

III  The  edge  effects 

•  The  edge  effects  in  vicinity  of  free  surface 

•  The  edge  effects  in  vicinity  of  intersection  of  two  free  surfaces 

•  The  edge  effects  at  the  corners 

IV  The  grain  structure  of  the  lamellae 

•  An  isolated  grain  within  a  lamella 

•  An  equilibrium  shape  of  an  isolated  grain  within  a  lamella 

•  Stability  of  an  isolated  grain  within  a  lamella 

•  A  periodic  ensemble  of  the  grains  within  the  isolated  lamella 

V  Effective  elastic  moduli  of  laminated  elastic  structures  with  large  internal 
stresses 

•  Effective  elastic  moduli  in  the  7/7  -  periodic  structure  (equal  elastic  moduli) 

•  Effective  elastic  moduli  in  the  a/7  -  periodic  structure  (different  elastic 
moduli) 

VI  The  non-elastic  effects 

•  Relaxation  of  misfit  stresses  via  the  mechanism  of  the  stress  driven  rear¬ 
rangement  instability  at  the  coherent  interfaces 

•  Relaxation  of  misfit  stresses  via  proliferation  of  misfit  dislocations 

•  Nucleation  of  misfit  dislocations  via  the  mechanism  of  the  stress  driven 
rearrangement  instability 
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